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1 
CORONAVIRUS 


FIELD OF THE INVENTION 


The present invention relates to an attenuated coronavirus 
comprising a variant replicase gene, which causes the virus 
to have reduced pathogenicity. The present invention also 
relates to the use of such a coronavirus in a vaccine to 
prevent and/or treat a disease. 


BACKGROUND TO THE INVENTION 


Avian infectious bronchitis virus (IBV), the aetiological 
agent of infectious bronchitis (IB), is a highly infectious and 
contagious pathogen of domestic fowl that replicates pri- 
marily in the respiratory tract but also in epithelial cells of 
the gut, kidney and oviduct. IBV is a member of the Order 
Nidovirales, Family Coronaviridae, Subfamily Corona viri- 
nae and Genus Gammacoronavirus; genetically very similar 
coronaviruses cause disease in turkeys, guinea fowl and 
pheasants. 

Clinical signs of IB include sneezing, tracheal rales, nasal 
discharge and wheezing. Meat-type birds have reduced 
weight gain, whilst egg-laying birds lay fewer eggs and 
produce poor quality eggs. The respiratory infection predis- 
poses chickens to secondary bacterial infections which can 
be fatal in chicks. The virus can also cause permanent 
damage to the oviduct, especially in chicks, leading to 
reduced egg production and quality; and kidney, sometimes 
leading to kidney disease which can be fatal. 

IBV has been reported to be responsible for more eco- 
nomic loss to the poultry industry than any other infectious 
disease. Although live attenuated vaccines and inactivated 
vaccines are universally used in the control of IBV, the 
protection gained by use of vaccination can be lost either due 
to vaccine breakdown or the introduction of a new IBV 
serotype that is not related to the vaccine used, posing a risk 
to the poultry industry. 

Further, there is a need in the industry to develop vaccines 
which are suitable for use in ovo, in order to improve the 
efficiency and cost-effectiveness of vaccination pro- 
grammes. A major challenge associated with in ovo vacci- 
nation is that the virus must be capable of replicating in the 
presence of maternally-derived antibodies against the virus, 
without being pathogenic to the embryo. Current IBV vac- 
cines are derived following multiple passage in embryonated 
eggs, this results in viruses with reduced pathogenicity for 
chickens, so that they can be used as live attenuated vac- 
cines. However such viruses almost always show an 
increased virulence to embryos and therefore cannot be used 
for in ova vaccination as they cause reduced hatchability. A 
70% reduction in hatchability is seen in some cases. 

Attenuation following multiple passage in embryonated 
eggs also suffers from other disadvantages. It is an empirical 
method, as attenuation of the viruses is random and will 
differ every time the virus is passaged, so passage of the 
same virus through a different series of eggs for attenuation 
purposes will lead to a different set of mutations leading to 
attenuation. There are also efficacy problems associated with 
the process: some mutations will affect the replication of the 
virus and some of the mutations may make the virus too 
attenuated. Mutations can also occur in the S gene which 
may also affect immunogenicity so that the desired immune 
response is affected and the potential vaccine may not 
protect against the required serotype. In addition there are 
problems associated with reversion to virulence and stability 
of vaccines. 
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It is important that new and safer vaccines are developed 
for the control of IBV. Thus there is a need for IBV vaccines 
which are not associated with these issues, in particular 
vaccines which may be used for in ovo vaccination. 


SUMMARY OF ASPECTS OF THE INVENTION 


The present inventors have used a reverse genetics 
approach in order to rationally attenuate IBV. This approach 
is much more controllable than random attenuation follow- 
ing multiple passages in embryonated eggs because the 
position of each mutation is known and its effect on the 
virus, i.e. the reason for attenuation, can be derived. 

Using their reverse genetics approach, the present inven- 
tors have identified various mutations which cause the virus 
to have reduced levels of pathogenicity. The levels of 
pathogenicity may be reduced such that when the virus is 
administered to an embryonated egg, it is capable of repli- 
cating without being pathogenic to the embryo. Such viruses 
may be suitable for in ovo vaccination, which is a significant 
advantage and has improvement over attenuated IBV vac- 
cines produced following multiple passage in embryonated 
eggs. 

Thus in a first aspect, the present invention provides a 
live, attenuated coronavirus comprising a variant replicase 
gene encoding polyproteins comprising a mutation in one or 
more of non-structural protein(s) (nsp)-10, nsp-14, nsp-15 or 
nsp-16. 

The variant replicase gene may encode a protein com- 
prising one or more amino acid mutations selected from the 
list of: 

Pro to Leu at position 85 of SEQ ID NO: 6, 

Val to Leu at position 393 of SEQ ID NO: 7; 

Leu to Ile at position 183 of SEQ ID NO: 8; 

Val to Ile at position 209 of SEQ ID NO: 9. 

The replicase gene may encode a protein comprising the 
amino acid mutation Pro to Leu at position 85 of SEQ ID 
NO: 6. 

The replicase gene may encode a protein comprising the 
amino acid mutations Val to Leu at position 393 of SEQ ID 
NO: 7; Leu to Ile at position 183 of SEQ ID NO: 8; and Val 
to Ile at position 209 of SEQ ID NO: 9. 

The replicase gene may encodes a protein comprising the 
amino acid mutations Pro to Leu at position 85 of SEQ ID 
NO: 6; Val to Leu at position 393 of SEQ ID NO:7; Leu to 
Ile at position 183 of SEQ ID NO:8; and Val to Ile at position 
209 of SEQ ID NO: 9. 

The replicase gene may comprise one or more nucleotide 
substitutions selected from the list of: 

C to T at nucleotide position 12137; 

G to C at nucleotide position 18114; 

T to A at nucleotide position 19047; and 

G to A at nucleotide position 20139; 

compared to the sequence shown as SEQ ID NO: 1. 

The coronavirus may be an infectious bronchitis virus 
(IBV). 

The coronavirus may be IBV M41. 

The coronavirus may comprise an S protein at least part 
of which is from an IBV serotype other than M41. 

For example, the S1 subunit or the entire S protein may 
be from an IBV serotype other than M41. 

The coronavirus according to the first aspect of the 
invention has reduced pathogenicity compared to a corona- 
virus expressing a corresponding wild-type replicase, such 
that when the virus is administered to an embryonated egg, 
it is capable of replicating without being pathogenic to the 
embryo. 


US 10,130,701 B2 


3 


In a second aspect, the present invention provides a 
variant replicase gene as defined in connection with the first 
aspect of the invention. 

In a third aspect, the present invention provides a protein 
encoded by a variant coronavirus replicase gene according 
to the second aspect of the invention. 

In a fourth aspect, the present invention provides a 
plasmid comprising a replicase gene according to the second 
aspect of the invention. 

In a fifth aspect, the present invention provides a method 
for making the coronavirus according to the first aspect of 
the invention which comprises the following steps: 

(1) transfecting a plasmid according to the fourth aspect of 

the invention into a host cell; 

(ii) infecting the host cell with a recombining virus 
comprising the genome of a coronavirus strain with a 
replicase gene; 

(iii) allowing homologous recombination to occur 
between the replicase gene sequences in the plasmid 
and the corresponding sequences in the recombining 
virus genome to produce a modified replicase gene; and 

(iv) selecting for recombining virus comprising the modi- 
fied replicase gene. 

The recombining virus may be a vaccinia virus. 

The method may also include the step: 

(v) recovering recombinant coronavirus comprising the 
modified replicase gene from the DNA from the recom- 
bining virus from step (iv). 

In a sixth aspect, the present invention provides a cell 
capable of producing a coronavirus according to the first 
aspect of the invention. 

In a seventh aspect, the present invention provides a 
vaccine comprising a coronavirus according to the first 
aspect of the invention and a pharmaceutically acceptable 
carrier. 

In an eighth aspect, the present invention provides a 
method for treating and/or preventing a disease in a subject 
which comprises the step of administering a vaccine accord- 
ing to the seventh aspect of the invention to the subject. 

Further aspects of the invention provide: 

the vaccine according to the seventh aspect of the inven- 
tion for use in treating and/or preventing a disease in a 
subject. 

use of a coronavirus according to the first aspect of the 
invention in the manufacture of a vaccine for treating 
and/or preventing a disease in a subject. 

The disease may be infectious bronchitis (IB). 

The method of administration of the vaccine may be 
selected from the group consisting of; eye drop administra- 
tion, intranasal administration, drinking water administra- 
tion, post-hatch injection and in ovo injection. 

Vaccination may be by in ova vaccination. 

The present invention also provides a method for produc- 
ing a vaccine according to the seventh aspect of the inven- 
tion, which comprises the step of infecting a cell according 
to the sixth aspect of the invention with a coronavirus 
according to the first aspect of the invention. 


DESCRIPTION OF THE FIGURES 


FIG. 1—Growth kinetics of M41-R-6 and M41-R-12 
compared to M41-CK (M41 EP4) on CK cells 

FIG. 2—Clinical signs, snicking and wheezing, associ- 
ated with M41-R-6 and M41-R-12 compared to M41-CK 
(M41 EP4) and Beau-R (Bars show mock, Beau-R, M41-R 
6, M41-R 12, M41-CK EP4 from left to right of each 
timepoint). 
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FIG. 3—Ciliary activity of the viruses in tracheal rings 
isolated from tracheas taken from infected chicks. 100% 
ciliary activity indicates no effect by the virus; apathogenic, 
0% activity indicates complete loss of ciliary activity, com- 
plete ciliostasis, indicating the virus is pathogenic (Bars 
show mock, Beau-R, M41-R 6, M41-R 12, M41-CK EP4 
from left to right of each timepoint). 

FIG. 4—Clinical signs, snicking, associated with M41R- 
nsp10rep and M41R-nsp14,15,16rep compared to M41-R- 
12 and M41-CK (M41 EPS) (Bars show mock, M41-R12; 
M41R-nsp10rep; M41R-nsp14,15,16rep and M41-CK EPS5 
from left to right of each timepoint). 

FIG. 5—Ciliary activity of M41R-nsp10rep and M41R- 
nsp14,15,16rep compared to M41-R-12 and M41-CK in 
tracheal rings isolated from tracheas taken from infected 
chicks (Bars show mock; M41-R12; M41R-nsp10rep; 
M41R-nsp14,15,16rep and M41-CK EPS from left to right 
of each timepoint). 

FIG. 6—Clinical signs, snicking, associated with M41R- 
nsp1l0, 15rep, M41R-nsp10, 14, 15rep, M41R-nsp10, 14, 
l6rep, M41R-nsp10, 15, 16rep and M41-K compared to 
M41-CK (Bars show mock, M41R-nsp10,15rep1; M41R- 
nsp10,14,16rep4; M41R-nsp10,15,16rep8; M41R-nsp10,14, 
15rep10; M41-K6 and M41-CK EP4 from left to right of 
each timepoint). 

FIG. 7—Clinical signs, wheezing, associated with M41R- 
nsp10, 15rep, M41R-nsp10, 14, 15rep, M41R-nsp10, 14, 
l6rep, M41R-nsp10, 15, 16rep and M41-K compared to 
M41-CK (Bars show mock, M41R-nsp10,15rep1; M14R- 
nsp10,14,16rep4; M41R-nsp10,15,16rep8; M41R-nsp10,14, 
15rep10; M41-K6 and M41-CK EP4 from left to right of 
each timepoint). 

FIG. 8—Ciliary activity of M41R-nsp10, 15rep, M41R- 
nsp10, 14, 15rep, M41R-nsp10, 14, l6rep, M41R-nsp10, 15, 
l6rep and M41-K compared to M41-CK in tracheal rings 
isolated from tracheas taken from infected chicks (Bars 
show mock, M41R-nsp10,15rep1; M41R-nsp10,14,16rep4; 
M41R-nsp10,15,16rep8; M41R-nsp10,14,15rep10; M41- 
K6 and M41-CK EP4 from left to right of each timepoint). 

FIG. 9—Growth kinetics of rIBVs compared to M41-CK 
on CK cells. FIG. 9A shows the results for M41-R and 
M41-K. FIG. 9B shows the results for M41-nsp10 rep; 
M41R-nsp14, 15, 16 rep; M41R-nsp10, 15 rep; M41R- 
nsp10, 15, 16 rep; M41R-nsp10, 14, 15 rep; and M41R- 
nsp10, 14, 16. 

FIG. 10—Position of amino acid mutations in mutated 
nsp10, nsp14, nsp15 and nsp16 sequences. 

FIG. 11—A) Snicking; B) Respiratory symptoms (wheez- 
ing and rales combined) and C) Ciliary activity of rIBV 
M41R-nsp 10,14 rep and rIBV M41R-nsp 10,16 rep com- 
pared to M41-CK (Bars show mock, M41R-nsp10,14rep; 
M41R-nsp10,16rep and M41-K from left to right of each 
timepoint). 


DETAILED DESCRIPTION 


The present invention provides a coronavirus comprising 
a variant replicase gene which, when expressed in the 
coronavirus, causes the virus to have reduced pathogenicity 
compared to a corresponding coronavirus which comprises 
the wild-type replicase gene. 

Coronavirus 

Gammacoronavirus is a genus of animal virus belonging 
to the family Coronaviridae. Coronaviruses are enveloped 
viruses with a positive-sense single-stranded RNA genome 
and a helical symmetry. 
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The genomic size of coronaviruses ranges from approxi- 
mately 27 to 32 kilobases, which is the longest size for any 
known RNA virus. 

Coronaviruses primarily infect the upper respiratory or 
gastrointestinal tract of mammals and birds. Five to six 
different currently known strains of coronaviruses infect 
humans. The most publicized human coronavirus, SARS- 
CoV which causes severe acute respiratory syndrome 
(SARS), has a unique pathogenesis because it causes both 
upper and lower respiratory tract infections and can also 
cause gastroenteritis. Middle East respiratory syndrome 
coronavirus (MERS-CoV) also causes a lower respiratory 
tract infection in humans. Coronaviruses are believed to 
cause a significant percentage of all common colds in human 
adults. 

Coronaviruses also cause a range of diseases in livestock 
animals and domesticated pets, some of which can be 
serious and are a threat to the farming industry. Economi- 
cally significant coronaviruses of livestock animals include 
infectious bronchitis virus (IBV) which mainly causes respi- 
ratory disease in chickens and seriously affects the poultry 
industry worldwide; porcine coronavirus (transmissible gas- 
troenteritis, TGE) and bovine coronavirus, which both result 
in diarrhoea in young animals. Feline coronavirus has two 
forms, feline enteric coronavirus is a pathogen of minor 
clinical significance, but spontaneous mutation of this virus 
can result in feline infectious peritonitis (FIP), a disease 
associated with high mortality. 

There are also two types of canine coronavirus (CCoV), 
one that causes mild gastrointestinal disease and one that has 
been found to cause respiratory disease. Mouse hepatitis 
virus (MHV) is a coronavirus that causes an epidemic 
murine illness with high mortality, especially among colo- 
nies of laboratory mice. 

Coronaviruses are divided into four groups, as shown 
below: 

Alpha 

Canine coronavirus (CCoV) 

Feline coronavirus (FeCoV) 

Human coronavirus 229E (HCoV-229EF) 

Porcine epidemic diarrhoea virus (PEDV) 

Transmissible gastroenteritis virus (TGEV) 

Human Coronavirus NL63 (NL or New Haven) 

Beta 

Bovine coronavirus (BCoV) 

Canine respiratory coronavirus (CRCoV)—Common 
in SE Asia and Micronesia 

Human coronavirus OC43 (HCoV-O0C43) 

Mouse hepatitis virus (MHV) 

Porcine haemagglutinating encephalomyelitis virus 
(HEV) 

Rat coronavirus (Roy). Rat Coronavirus is quite preva- 
lent in Eastern Australia where, as of March/April 
2008, it has been found among native and feral 
rodent colonies. 

(No common name as of yet) (HCoV-HKU1) 

Severe acute respiratory syndrome coronavirus 
(SARS-CoV) 

Middle East respiratory syndrome coronavirus (MERS- 
CoV) 

Gamma 

Infectious bronchitis virus (IBV) 

Turkey coronavirus (Bluecomb disease virus) 

Pheasant coronavirus 

Guinea fowl coronavirus 
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6 
Delta 
Bulbul coronavirus (BuCoV) 
Thrush coronavirus (ThCoV) 
Munia coronavirus (MuCoV) 
Porcine coronavirus (PorCov) HKU15 

The variant replicase gene of the coronavirus of the 
present invention may be derived from an alphacoronavirus 
such as TGEV; a betacoronavirus such as MHV; or a 
gammacoronavirus such as IBV. 

As used herein the term “derived from” means that the 
replicase gene comprises substantially the same nucleotide 
sequence as the wild-type replicase gene of the relevant 
coronavirus. For example, the variant replicase gene of the 
present invention may have up to 80%, 85%, 90%, 95%, 
98% or 99% identity with the wild type replicase sequence. 
The variant coronavirus replicase gene encodes a protein 
comprising a mutation in one or more of non-structural 
protein (nsp)-10, nsp-14, nsp-15 or nsp-16 when compared 
to the wild-type sequence of the non-structural protein. 

IBV 

Avian infectious bronchitis (IB) is an acute and highly 
contagious respiratory disease of chickens which causes 
significant economic losses. The disease is characterized by 
respiratory signs including gasping, coughing, sneezing, 
tracheal rales, and nasal discharge. In young chickens, 
severe respiratory distress may occur. In layers, respiratory 
distress, nephritis, decrease in egg production, and loss of 
internal egg quality and egg shell quality are common. 

In broilers, coughing and rattling are common clinical 
signs, rapidly spreading in all the birds of the premises. 
Morbidity is 100% in non-vaccinated flocks. Mortality var- 
ies depending on age, virus strain, and secondary infections 
but may be up to 60% in non-vaccinated flocks. 

The first IBV serotype to be identified was Massachusetts, 
but in the United States several serotypes, including Arkan- 
sas and Delaware, are currently circulating, in addition to the 
originally identified Massachusetts type. 

The IBV strain Beaudette was derived following at least 
150 passages in chick embryos. IBV Beaudette is no longer 
pathogenic for hatched chickens but rapidly kills embryos. 

H120 is a commercial live attenuated IBV Massachusetts 
serotype vaccine strain, attenuated by approximately 120 
passages in embryonated chicken eggs. H52 is another 
Massachusetts vaccine, and represents an earlier and slightly 
more pathogenic passage virus (passage 52) during the 
development of H120. Vaccines based on H120 are com- 
monly used. 

IB QX is a virulent field isolate of IBV. It is sometimes 
known as “Chinese QX” as it was originally isolated fol- 
lowing outbreaks of disease in the Qingdao region in China 
in the mid 1990s. Since that time the virus has crept towards 
Europe. From 2004, severe egg production issues have been 
identified with a very similar virus in parts of Western 
Europe, predominantly in the Netherlands, but also reported 
from Germany, France, Belgium, Denmark and in the UK. 

The virus isolated from the Dutch cases was identified by 
the Dutch Research Institute at Deventer as a new strain that 
they called D388. The Chinese connection came from fur- 
ther tests which showed that the virus was 99% similar to the 
Chinese QX viruses. A live attenuated QX-like IBV vaccine 
strain has now been developed. 

IBV is an enveloped virus that replicates in the cell 
cytoplasm and contains an non-segmented, single-stranded, 
positive sense RNA genome. IBV has a 27.6 kb RNA 
genome and like all coronaviruses contains the four struc- 
tural proteins; spike glycoprotein (S), small membrane pro- 
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tein (E), integral membrane protein (M) and nucleocapsid 
protein (N) which interacts with the genomic RNA. 


The genome is organised in the following manner: 
S5'UTR—polymerase (replicase) gene—structural protein 
genes (S-E-M-N)—UTR 3'; where the UTR are untranslated 
regions (each ~500 nucleotides in IBV). 


The lipid envelope contains three membrane proteins: S, 
M and E. The IBV S protein is a type I glycoprotein which 
oligomerizes in the endoplasmic reticulum and is assembled 
into homotrimer inserted in the virion membrane via the 
transmembrane domain and is associated through non-co- 
valent interactions with the M protein. Following incorpo- 
ration into coronavirus particles, the S protein is responsible 
for binding to the target cell receptor and fusion of the viral 
and cellular membranes. The S glycoprotein consists of four 
domains: a signal sequence that is cleaved during synthesis; 
the ectodomain, which is present on the outside of the virion 
particle; the transmembrane region responsible for anchor- 
ing the S protein into the lipid bilayer of the virion particle; 
and the cytoplasmic tail. 


All coronaviruses also encode a set of accessory protein 
genes of unknown function that are not required for repli- 
cation in vitro, but may play a role in pathogenesis. IBV 
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encodes two accessory genes, genes 3 and 5, which both 
express two accessory proteins 3a, 3b and 5a, 5b, respec- 
tively. 

The variant replicase gene of the coronavirus of the 
present invention may be derived from an IBV. For example 
the IBV may be IBV Beaudette, H120, H52, IB QX, D388 
or M41. 

The IBV may be IBV M41. M41 is a prototypic Massa- 
chusetts serotype that was isolated in the USA in 1941. It is 
an isolate used in many labs throughout the world as a 
pathogenic lab stain and can be obtained from ATCC (VR- 
21™). Attenuated variants are also used by several vaccine 
producers as IBV vaccines against Massachusetts serotypes 
causing problems in the field. The present inventors chose to 
use this strain as they had worked for many years on this 
virus, and because the sequence of the complete virus 
genome is available. The M41 isolate, M41-CK, used by the 
present inventors was adapted to grow in primary chick 
kidney (CK) cells and was therefore deemed amenable for 
recovery as an infectious virus from a cDNA of the complete 
genome. It is representative of a pathogenic IBV and there- 
fore can be analysed for mutations that cause either loss or 
reduction in pathogenicity. 

The genome sequence of IBV M41-CK is provided as 
SEQ ID NO: 1. 


IBV M41-CK Sequence 
SEQ ID NO: 1 
ACTTAAGATAGATATTAATATATATCTATCACACTAGCCTTGCGCTAGATTTCCAACTTA 
ACAAAACGGACT TAAATACCTACAGCTGGTCCTCATAGGTGTTCCATTGCAGTGCACTTT 
AGTGCCCTGGATGGCACCTGGCCACCTGTCAGGTTTTTGTTATTAAAATCTTATTGTTGC 
TGGTATCACTGCTTGTTTTGCCGTGTCTCACTTTATACATCCGTTGCTTGGGCTACCTAG 
TATCCAGCGTCCTACGGGCGCCGTGGCTGGTT CGAGTGCGAAGAACCTCTGGTTCATCTA 
GCGGTAGGCGGGTGTGTGGAAGTAGCACTTCAGACGTACCGGTTCTGTTGTGTGAAATAC 
GGGGTCACCTCCCCCCACATACCTCTAAGGGCTTTTGAGCCTAGCGTTGGGCTACGTTCT 
CGCATAAGGTCGGCTATACGACGTT TGTAGGGGGTAGTGC CAAACAACCCCTGAGGTGAC 
AGGTTCTGGTGGTGTT TAGTGAGCAGACATACAATAGACAGTGACAACATGGCTTCAAGC 
CTAAAACAGGGAGTATCTGCGAAACTAAGGGATGTCATTGTTGTATCCAAAGAGATTGCT 
GAACAACTTTGTGACGCTTTGTTTTTCTATACGTCACACAACCCTAAGGATTACGCTGAT 
GCTTTTGCAGTTAGGCAGAAGTTTGATCGTAATCTGCAGACTGGGAAACAGTTCAAATTT 
GAAACTGTGTGTGGTCTCTTCCTCTTGAAGGGAGTTGACAAAATAACACCTGGCGTCCCA 
GCAAAAGT CT TAAAAGCCACTTCTAAGTTGGCAGATTTAGAAGACATCTTTGGTGTCTCT 
CCCTTTGCAAGAAAATATCGTGAACTT TTGAAGACAGCATGCCAGTGGTCTCTTACTGTA 
GAAACACTGGATGCTCGTGCACAAACTCTTGATGAAATTTTTGACCCTACTGAAATACTT 
TGGCTT CAGGTGGCAGCAAAAAT CCAAGTTTCGGCTATGGCGATGCGCAGGCTTGTTGGA 
GAAGTAACTGCAAAAGTCATGGATGCTTTGGGCTCAAATATGAGTGCTCTTTTCCAGATT 


TTTAAACAACAAATAGTCAGAATTTTTCAAAAAGCGCTGGCTATTTTTGAGAATGTGAGT 


GAATTACCACAGCGTATTGCAGCACTTAAGATGGCTTTTGCTAAGTGTGCCAAGTCCATT 


ACTGTTGTGGTTATGGAGAGGACTCTAGTTGT TAGAGAGT TCGCAGGAACTTGTCTTGCA 


AGCATTAATGGTGCTGTTGCAAAATTCTTTGAAGAACTCCCAAATGGTTTCATGGGTGCT 


AAAATTTTCACTACACTTGCCTTCTTTAGGGAGGCTGCAGTGAAAAT TGTGGATAACATA 


CCAAATGCACCGAGAGGCACTAAAGGGTT TGAAGTCGT TGGTAATGCCAAAGGTACACAA 
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-continued 
GTTGTTGTGCGTGGCATGGGAAATGACTTAACACTGGTTGAGCAAAAAGCTGAAATTGCT 


GTGGAGTCAGAAGGTTGGT CTGCAATTTTGGGTGGACATCTTTGCTATGTCTTTAAGAGT 
GGTGATCGCTTTTACGCGGCACCTCTTTCAGGAAAT TT TGCATTGCATGATGTGCATTGT 
TGTGAGCGTGTTGTCTGTCTTTCTGATGGTGTAACACCGGAGATAAATGATGGACTTATT 
CTTGCAGCAATCTACTCTTCTTTTAGTGTCGCAGAACT TGTGGCAGCCATTAAAAGGGGT 
GAACCATTTAAGTTTCTGGGTCATAAATTTGTGTATGCAAAGGATGCAGCAGTTTCTTTT 
ACATTAGCGAAGGCTGCTACTATTGCAGATGTTTTGAAGCTGTTTCAATCAGCGCGTGTG 
AAAGTAGAAGATGTTTGGTCTTCACTTACTGAAAAGTCTTTTGAATTCTGGAGGCTTGCA 
TATGGAAAAGTGCGTAATCTCGAAGAATTTGTTAAGACTTGTTTTTGTAAGGCTCAAATG 
GCGATTGTGATTTTAGCGACAGTGCTTGGAGAGGGCATTTGGCATCTTGTTTCGCAAGTC 
ATCTATAAAGTAGGTGGTCTTTTTACTAAAGTTGTTGACTTTTGTGAAAAATATTGGAAA 
GGTTTTTGTGCACAGTTGAAAAGAGCTAAGCT CATTGTCACTGAAACCCTCTGTGTTTTG 
AAAGGAGTTGCACAGCATTGTTTTCAACTATTGCTGGATGCAATACAGTTTATGTATAAA 
AGTTTTAAGAAGTGTGCACTTGGTAGAATCCATGGAGACTTGCTCTTCTGGAAAGGAGGT 
GTGCACAAAATTATTCAAGAGGGCGATGAAAT TTGGTT TGAGGGCATTGATAGTATTGAT 
GTTGAAGATCTGGGTGT TGTTCAAGAAAAATTGATTGATTTTGATGTTTGTGATAATGTG 
ACACTTCCAGAGAACCAACCCGGTCATATGGT TCAAAT CGAGGATGACGGAAAGAACTAC 
ATGTTCTTCCGCTT CAAAAAGGATGAGAACATTTATTATACACCAATGTCACAGCTTGGT 
GCTATTAATGTGGTTTGCAAAGCAGGCGGTAAAACTGTCACCTTTGGAGAAACTACTGTG 
CAAGAAATACCACCACCTGATGTTGTGTTTATTAAGGT TAGCATTGAGTGTTGTGGTGAA 
CCATGGAATACAAT CTT CAAAAAGGCT TATAAGGAGCCCATTGAAGTAGAGACAGACCTC 
ACAGTTGAACAATTGCTCTCTGTGGTCTATGAGAAAATGTGTGATGATCTCAAGCTGTTT 
CCGGAGGCTCCAGAACCACCACCATTTGAGAATGTCACACTTGTTGATAAGAATGGTAAA 
GATT TGGATTGCATAAAATCATGCCATCTGATCTATCGTGATTATGAGAGCGATGATGAC 
ATCGAGGAAGAAGATGCAGAAGAATGTGACACGGAT TCAGGTGATGCTGAGGAGTGTGAC 
ACTAAT TCAGAATGTGAAGAAGAAGATGAGGATACTAAAGTGTTGGCTCTTATACAAGAC 
CCGGCAAGTAACAAATATCCTCTGCCTCTTGATGATGATTATAGCGTCTACAATGGATGT 
ATTGTTCATAAGGACGCTCTCGATGTTGTGAATT TACCATCTGGTGAAGAAACCTTTGTT 
GTCAATAACTGCTT TGAAGGGGCTGTTAAAGCTCTTCCGCAGAAAGTTATTGATGTTCTA 
GGTGACTGGGGTGAGGCTGTTGATGCGCAAGAACAATTGTGTCAACAAGAATCAACTCGG 
GTCATATCTGAGAAATCAGTTGAGGGT TTTACTGGTAGTTGTGATGCAATGGCTGAACAA 
GCTATTGT TGAAGAGCAGGAAATAGTACCTGT TGTTGAACAAAGT CAGGATGTAGTTGTT 
TTTACACCTGCAGACCTAGAAGT TGTTAAAGAAACAGCAGAAGAGGT TGATGAGTTTATT 
CTCATTTCTGCTGTCCCTAAAGAAGAAGTTGTGT CT CAGGAGAAAGAGGAGCCACAGGTT 
GAGCAAGAGCCTACCCTAGTTGT TAAAGCACAACGT GAGAAGAAGGC TAAAAAGTT CAAA 


GTTAAACCAGCTACATGTGAAAAACCCAAATT TT TGGAGTACAAAACATGTGTGGGTGAT 


TTGGCTGTTGTAAT TGCCAAAGCAT TGGATGAGT TTAAAGAGT TCTGCATTGTAAACGCT 


GCAAATGAGCACATGTCGCATGGTGGTGGCGT TGCAAAGGCAATTGCAGACTTTTGTGGA 


CCGGACTTTGTTGAATATTGCGCGGAC TAT GT TAAGAAACATGGT CCACAGCAAAAACTT 


GTCACACCTTCATTTGT TAAAGGCATT CAATGTGTGAATAATGTTGTAGGACCTCGCCAT 


GGAGACAGCAACTTGCGTGAGAAGCTTGTTGCTGCT TACAAGAGTGTTCTTGTAGGTGGA 
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-continued 


GTGGTTAACTATGTTGTGCCAGTTCTCTCATCAGGGATTTTTGGTGTAGATTTTAAAATA 
TCAATAGATGCTATGCGCGAAGCTTTTAAAGGTTGTGCCATACGCGTTCTTTTATTTTCT 
CTGAGT CAAGAACACATCGATTATTTCGATGCAACT TGTAAGCAGAAGACAATTTATCTT 
ACGGAGGATGGTGT TAAATACCGCTCTGTTGTTTTAAAACCTGGTGATTCTTTGGGTCAA 


TTTGGACAGGTTTT TGCAAGAAATAAGGTAGTCTTTTCGGCTGATGATGTTGAGGATAAA 


GAAATCCTCTTTATACCCACAACTGACAAGACTATTCTTGAATATTATGGTTTAGATGCG 


CAAAAGTATGTAACATATT TGCAAACGCTTGCGCAGARATGGGATGT TCAATATAGAGAC 


AATTTTGTTATATTAGAGTGGCGTGACGGAAATTGCTGGATTAGTTCAGCAATAGTTCTC 


CTTCAAGCTGCTAAAAT TAGATT TAAAGGT TT TCTTGCAGAAGCATGGGCTAAACTGTTG 


GGTGGAGATCCTACAGACTTTGTTGCCTGGTGTTATGCAAGTTGCAATGCTAAAGTAGGT 


GATTTTTCAGATGCTAATTGGCTTTTGGCCAATT TAGCAGAACATTTTGACGCAGATTAC 


ACAAATGCACTTCTTAAGAAGTGTGTGTCGTGCAATTGTGGTGTTAAGAGTTATGAACTT 


AGGGGTCTTGAAGCCTGTATTCAGCCAGTTCGAGCACCTAATCTTCTACATTTTAAAACG 


CAATATTCAAATTGCCCAACCTGTGGTGCAAGTAGTACGGATGAAGTAATAGAAGCTTCA 


TTACCGTACTTATTGCTTTTTGCTACTGATGGTCCTGCTACAGTTGATTGTGATGAAAAT 


GCTGTAGGGACTGTTGTTTTCATTGGCTCTACTAATAGTGGCCATTGTTATACACAAGCC 


GATGGTAAGGCTTTTGACAAT CT TGCTAAGGATAGAAAAT TTGGAAGGAAGTCGCCTTAC 


ATTACAGCAATGTATACACGTTTTTCTCTTAGGAGTGAAAATCCCCTACTTGTTGTTGAA 


CATAGTAAGGGTAAAGC TAAAGTAGTAAAAGAAGATGTTTCTAACCTTGCTACTAGTTCT 


AAAGCCAGTTTTGACGATCTTACTGACTTTGAACACTGGTATGATAGCAACATCTATGAG 


AGTCTTAAAGTGCAGGAGACACCTGATAATCTTGATGAATATGTGTCATTTACGACAAAG 


GAAGATTCTAAGTTGCCACTGACACTTAAAGT TAGAGGTATCAAATCAGTTGTTGACTTT 


AGGTCTAAGGATGGTTTTACTTATAAGTTAACACCTGATACTGATGAAAATTCAAAAACA 


CCAGTCTACTACCCAGTCTTGGATTCTATTAGTCTTAGGGCAATATGGGTTGAAGGCAGT 


GCTAATTTTGTTGTTGGGCATCCAAAT TAT TATAGTAAGTCTCTCCGAATTCCCACGTTT 


TGGGAAAATGCCGAGAGCTTTGT TAAAATGGGTTATAAAATTGATGGTGTAACTATGGGC 


CTTTGGCGTGCAGAACACCTTAATAAACCTAATT TGGAGAGAATTTTTAACATTGCTAAG 


AAAGCTATTGTTGGATCTAGTGTTGTTACTACGCAGTGTGGTAAAATACTAGT TAAAGCA 


GCTACATACGTTGCCGATAAAGTAGGTGATGGTGTAGT TCGCAATATTACAGATAGAATT 


AAGGGTCTTTGTGGATT CACACGTGGCCAT TT TGAAAAGAAAATGTCCCTACAATTTCTA 


AAGACACTTGTGTTCTTTTTCTTTTATTTCTTAAAGGCTAGTGCTAAGAGTTTAGTTTCT 


AGCTATAAGATTGTGTTATGTAAGGTGGTGTTTGCTACCTTACTTATAGTGTGGTTTATA 


TACACAAGTAAT CCAGTAGTGTT TACTGGAATACGTGTGCTAGACTTCCTATTTGAAGGT 


TCTTTATGTGGTCCTTATAATGACTACGGTAAAGATTCTTTTGATGTGTTACGGTATTGT 


GCAGGTGATTTTACTTGTCGTGTGTGTTTACATGATAGAGATTCACTTCATCTGTACAAA 


CATGCTTATAGCGTAGAACAAAT TTATAAGGATGCAGCTTCTGGCATTAACTTTAATTGG 


AATTGGCTTTATTTGGTCTTTCTAATATTATTTGTTAAGCCAGTGGCAGGTTTTGTTATT 


ATTTGTTATTGTGTTAAGTATTTGGTATTGAGTT CAACTGTGT TGCAAACTGGTGTAGGT 


TTTCTAGATTGGTT TGTAAAAACAGTTTTTACCCATTTTAATTTTATGGGAGCGGGATTT 


TATTTCTGGCTCTT TTACAAGATATACGTACAAGTGCATCATATATTGTACTGTAAGGAT 
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-continued 
GTAACATGTGAAGTGTGCAAGAGAGTTGCACGCAGCAACAGGCAAGAGGTTAGCGTTGTA 


GTTGGTGGACGCAAGCAAATAGTGCATGTTTACACTAATTCTGGCTATAACTTTTGTAAG 
AGACATAATTGGTATTGTAGAAATTGTGATGATTATGGTCACCAAAATACATTTATGTCC 
CCTGAAGTTGCTGGCGAGCTTTCTGAAAAGCT TAAGCGCCATGTTAAACCTACAGCATAT 


GCTTACCACGTTGTGTATGAGGCATGCGTGGT TGATGATTTTGTTAATT TAAAATATAAG 


GCTGCAAT TGCTGGTAAGGATAATGCATCTTCTGCTGTTAAGTGTTTCAGTGTTACAGAT 


TTTTTAAAGAAAGCTGTTTTTCT TAAGGAGGCAT TGAAATGTGAACAAATATCTAATGAT 


GGTTTTATAGTGTGTAATACACAGAGTGCGCATGCACTAGAGGAAGCAAAGAATGCAGCC 


GTCTATTATGCGCAATATCTGTGTAAGCCAATACTTATACTTGACCAGGCACTTTATGAG 


CAAT TAATAGTAGAGCCTGTGTCTAAGAGTGT TATAGATAAAGTGTGTAGCATTTTGTCT 


AATATAATATCTGTAGATACTGCAGCTTTAAATTATAAGGCAGGCACACTTCGTGATGCT 


CTGCTTTCTATTACTAAAGACGAAGAAGCCGTAGATATGGCTATCTTCTGCCACAATCAT 


GAAGTGGAATACACTGGTGACGGTTTTACTAATGTGATACCGTCATATGGTATGGACACT 


GATAAGTTGACACCTCGTGATAGAGGGTTTTTGATAAATGCAGATGCTTCTATTGCTAAT 


TTAAGAGTCAAAAATGCTCCTCCGGTAGTATGGAAGTTTTCTGATCTTATTAAATTGTCT 


GACAGTTGCCTTAAATATTTAATTTCAGCTACTGTCAAGTCAGGAGGTCGTTTCTTTATA 


ACAAAGTCTGGTGCTAAACAAGTTATTTCTTGTCATACCCAGAAACTGT TGGTAGAGAAA 


AAGGCAGGTGGTGT TAT TAATAACACTTTTAAATGGTTTATGAGTTGTTTTAAATGGCTT 


TTTGTCTTTTATATACTTTTTACAGCATGTTGTTTGGGTTACTACTATATGGAGATGAAT 


AAAAGTTTTGTTCACCCCATGTATGATGTAAACTCCACACTGCATGTTGAAGGGTTCAAA 


GTTATAGACAAAGGTGT TATTAGAGAGATTGTGTCAGAAGATAATTGTTTCTCTAATAAG 


TTTGTTAATTTTGACGCCTTT TGGGGTAAATCATATGAAAATAATAAAAACTGTCCAATT 


GTTACAGT TGTTATAGATGGTGACGGGACAGTAGCTGTTGGTGTTCCTGGTTTTGTATCA 


TGGGTTATGGATGGTGTTATGTTTGTGCATATGACACAGACTGATCGTAGACCTTGGTAC 


ATTCCTACCTGGTT TAATAGAGAAATTGTTGGTTACACTCAGGATTCAATTATCACTGAG 


GGTAGTTTTTATACATCTATAGCATTATTTTCTGCTAGATGTTTATATTTAACAGCCAGC 


AATACACCTCAATTGTATTGTTT TAATGGCGACAATGATGCACCTGGAGCCTTACCATTT 


GGTAGTATTATTCCTCATAGAGTATACTTCCAACCTAATGGTGTTAGGCTTATAGTTCCA 


CAACAAATACTGCATACACCCTACATAGTGAAGT TTGTTTCAGACAGCTATTGTAGAGGT 


AGTGTATGTGAGTATACTAAACCAGGT TACTGTGTGTCACTAGACTCCCAATGGGTTTTG 


TTTAATGATGAATACAT TAGTAAACCTGGCGTTTTCTGTGGTTCTACTGTTAGAGAACTT 


ATGTTTAATATGGTTAGTACATTCTTTACTGGTGTCAACCCTAATATTTATATTCAGCTA 


GCAACTATGTTTTTAATACTAGTTGTTATTGTGTTAATTTTTGCAATGGTTATAAAGTTT 


CAAGGTGTTTTTAAAGCTTATGCGACCATTGTGTTTACAATAATGTTAGTTTGGGTTATT 


AATGCATTTGTTTTGTGTGTACATAGT TATAATAGTGTTTTAGCTGTTATATTATTAGTA 


CTCTATTGCTATGCATCATTGGT TACAAGT CGCAATACTGCTATAATAATGCATTGTTGG 


CTTGTTTTTACCTTTGGTT TAATAGTACCCACATGGTTGGCTTGTTGCTATCTGGGATTT 


ATTCTTTATATGTACACACCGTTGGTTTTCTGGTGTTACGGTACTACTAAAAATACTCGT 


AAGTTGTATGATGGCAACGAGTTTGTTGGTAATTATGACCTTGCTGCGAAGAGCACTTTT 


GTTATTCGTGGTACTGAAT TTGT TAAGCTTACGAAT GAGATAGGTGATAAATT TGAAGCC 


TATCTTTCTGCGTATGCTAGACT TAAATACTATT CAGGCACTGGTAGTGAGCAAGATTAC 
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-continued 


TTGCAAGCTTGTCGTGCATGGTTAGCT TATGCTT TGGACCAATATAGAAATAGTGGTGTT 
GAGGTTGTTTATACCCCACCGCGTTACTCTATTGGTGT TAGTAGACTACACGCTGGTTTT 
AAAAAACTAGTTTCTCCTAGTAGTGCTGTTGAGAAGTGCATTGTTAGTGTCTCTTATAGA 
GGCAATAATCTTAATGGACTGTGGCTGGGTGATTCTATTTACTGCCCACGCCATGTGTTA 


GGTAAGTT TAGTGGTGACCAGTGGGGTGACGTACTAAACCTTGCTAATAATCATGAGTTT 


GAAGTTGTAACT CAAAATGGTGT TACTTTGAATGTTGTCAGCAGGCGGCTTAAAGGAGCA 


GTTTTAATTTTACAAACTGCAGT TGCCAATGCTGAAACTCCTAAGTATAAGTTTGTTAAA 


GCTAATTGTGGTGATAGTTTCACTATAGCTTGTTCT TATGGTGGTACAGTTATAGGACTT 


TACCCTGTCACTATGCGTTCTAATGGTACTATTAGAGCATCTTTCCTAGCAGGAGCCTGT 


GGCTCAGTTGGTTT TAATATAGAAAAGGGTGTAGTTAATTTCTTTTATATGCACCATCTT 


GAGTTACCTAATGCATTACACACTGGAACTGACCTAATGGGTGAGTTTTATGGTGGTTAT 


GTAGATGAAGAGGT TGCGCAAAGAGTGCCACCAGATAATCTAGTTACTAACAATATTGTA 


GCATGGCTCTATGGGGCAATTAT TAGTGTTAAAGAAAGTAGTTTTTCACAACCTAAATGG 


TTGGAGAGTACTACTGTTTCTATTGAAGAT TACAATAGGTGGGCTAGTGATAATGGTTTT 


ACTCCATTTTCCACTAGTACTGCTATTACTAAAT TAAGTGCTATAACTGGGGTTGATGTT 


TGTAAACTCCTTCGCACTATTATGGTAAAAAGTGCTCAATGGGGTAGTGATCCCATTTTA 


GGACAATATAAT TT TGAAGACGAAT TGACACCAGAATCTGTATTTAATCAAGTTGGTGGT 


GTTAGGTTACAGTCTTCTTTTGTAAGAAAAGCTACATCTTGGTTTTGGAGTAGATGTGTA 


TTAGCTTGCTTCTTGTTTGTGTTGTGTGCTATTGTCTTATTTACGGCAGTGCCACTTAAG 


TTTTATGTACATGCAGCTGTTATTTTGTTGATGGCTGTGCTCTTTATTTCTTTTACTGTT 


AAACATGTTATGGCATACATGGACACTTTCCTATTGCCTACATTGATTACAGTTATTATT 


GGAGTTTGTGCTGAAGTCCCTTTCATATACAATACTCTAATTAGTCAAGTTGTTATTTTC 


TTAAGCCAATGGTATGATCCTGTAGTCTTTGATACTATGGTACCATGGATGTTATTGCCA 


TTAGTGTTGTACACTGCTTTTAAGTGTGTACAAGGCTGCTATATGAATTCTTTCAATACT 


TCTTTGTTAATGCTGTATCAGTTTATGAAGTTAGGTTTTGTTATTTACACCTCTTGAAAC 


ACTCTTACTGCATATACAGAAGGTAAT TGGGAGTTATTCTTTGAGTTGGTTCACACTATT 


GTGTTGGCTAATGTTAGTAGTAATTCCTTAATTGGTTTAATTGTTTTTAAGTGTGCTAAG 


TGGATTTTATATTATTGCAATGCAACATACTT TAATAATTATGTGTTAATGGCAGTCATG 


GTTAATGGCATAGGCTGGCTTTGCACCTGTTACTTTGGATTGTATTGGTGGGT TAATAAA 


GTTTTTGGTTTAACCTTAGGTAAATACAATTT TAAAGT TT CAGTAGATCAATATAGGTAT 


ATGTGTTTGCATAAGGTAAAT CCACCTAAAACTGTGTGGGAGGTCTT TACTACAAATATA 


CTTATACAAGGAAT TGGAGGCGATCGTGTGTTGCCTATAGCTACAGTGCAATCTAAATTG 


AGTGATGTAAAGTGTACAACTGTTGTTTTAATGCAGCTTTTGACTAAGCTTAATGTTGAA 


GCAAAT TCAAAAATGCATGCTTATCTTGTTGAGT TACACAATAAAATCCTCGCATCTGAT 


GATGTTGGAGAGTGCATGGATAATT TATTGGGTATGCT TATAACACTATTTTGTATAGAT 


TCTACTATTGATTTGGGTGAGTATTGTGATGATATACT TAAGAGGTCAACTGTATTACAA 


TCGGTTACTCAAGAGTTTTCGCACATACCCTCGTATGCTGAATATGAAAGAGCTAAGAGT 


ATTTATGAAAAGGT TTTAGCCGATT CTAAAAATGGTGGTGTAACACAGCAAGAGCTTGCT 


GCATATCGTAAAGCTGCCAATAT TGCAAAGTCAGTT TT TGATAGAGACTTGGCTGTTCAA 


AAGAAGTTAGATAGCATGGCAGAACGTGCTATGACAACAATGTATAAAGAGGCGCGTGTA 
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ACTGATAGAAGAGCAAAATTAGTTTCATCATTACATGCACTACTTTTTTCAATGCTTAAG 


AAAATAGATTCTGAGAAGCTTAATGTCTTATTTGACCAGGCGAATAGTGGTGTTGTACCC 
CTAGCAACTGTTCCAATTGTTTGTAGTAATAAGCTTACCCTTGTTATACCAGACCCAGAG 
ACGTGGGT CAAGTGTGTGGAGGGTGTGCATGT TACATATTCAACAGTTGTTTGGAATATA 
GACTGTGTTACTGATGCCGATGGCACAGAGTTACACCCCACTTCTACAGGTAGTGGATTG 
ACTTACTGTATAAGTGGTGATAATATAGCATGGCCTTTAAAGGTTAACTTGACTAGGAAT 


GGGCATAATAAGGTTGATGTTGCCTTGCAAAATAATGAGCTTATGCCTCACGGTGTAAAG 


ACAAAGGCTTGCGTAGCAGGTGTAGATCAAGCACATTGTAGCGTTGAGTCTAAATGTTAT 


TATACAAGTATTAGTGGCAGTTCAGTTGTAGCTGCTATTACCTCTTCAAATCCTAATCTG 


AAAGTAGCCTCTTTTTTGAATGAGGCAGGTAATCAGATTTATGTAGACTTAGACCGAGCA 


TGTAAATT TGGTATGAAAGTGGGTGATAAGGT TGAAGTTGTTTACCTGTATTTTATAAAA 


AATACGAGGTCTATTGTAAGAGGTATGGTACTTGGTGCTATATCTAATGTTGTTGTGTTA 


CAATCTAAAGGT CATGAGACAGAGGAAGTGGATGCTGTAGGCATTCTCTCACTTTGTTCT 


TTTGCAGTAGAT CCTGCGGATACATAT TGTAAATATGTGGCAGCAGGTAATCAACCTTTA 


GGTAACTGTGTTAAAATGT TGACAGTACATAATGGTAGTGGTTTTGCAATAACATCAAAG 


CCAAGTCCAACTCCGGATCAGGATTCTTATGGAGGAGCTTCTGTGTGTCTTTATTGTAGA 


GCACATATAGCACACCCTGGCGGAGCAGGAAATT TAGATGGACGCTGTCAATTTAAAGGT 


TCTTTTGTGCAAATACCTACTACGGAGAAAGATCCTGTTGGATTCTGTCTACGTAACAAG 


GTTTGCACTGTTTGTCAGTGTTGGATTGGT TATGGATGTCAGTGTGATTCACTTAGACAA 


CCTAAACCTTCTGTTCAGTCAGTTGCTGTTGCATCTGGTTTTGATAAGAATTATTTAAAC 


GGGTACGGGGTAGCAGTGAGGCT CGGCTGATACCCCTAGCTAATGGATGTGACCCCGATG 


TTGTAAAGCGAGCCTTTGATGTT TGTAATAAGGAAT CAGCCGGTATGTTTCAAAATTTGA 


AGCGTAACTGTGCACGATT CCAAGAAGTACGTGATACTGAAGATGGAAATCTTGAGTATT 


GTGATTCTTATTTTGTGGT TAAACAAACCACTCCTAGTAATTATGAACATGAGAAAGCTT 


GTTATGAAGACT TAAAGTCAGAAGTAACAGCTGATCATGATTTCTTTGTGTTCAATAAGA 


ACATTTATAATATTAGTAGGCAGAGGCTTACTAAGTATACTATGATGGATTTTTGCTATG 


CTTTGCGGCACTTTGACCCAAAGGATTGCGAAGT TCTTAAAGAAATACTTGTCACTTATG 


GTTGTATAGAAGAT TAT CACCCTAAGT GGT TTGAAGAGAATAAGGAT TGGTACGACCCAA 


TAGAAAACCCTAAATAT TATGCCATGT TGGCTAAAATGGGACCTATTGTACGAGGTGCTT 


TATTGAATGCTATTGAGTTCGGAAACCTCATGGT TGAAAAAGGTTATGTTGGTGTTATTA 


CACTTGATAACCAAGATCTTAATGGCAAATTTTATGATTTTGGTGATTTTCAGAAGACAG 


CGCCTGGTGCTGGTGTTCCTGTTTTTGATACGTATTATTCTTACATGATGCCCATCATAG 


CCATGACTGATGCGTTGGCACCTGAGAGGTAT TT TGAATATGATGTGCATAAGGGTTATA 


AATCTTATGATCTCCTCAAGTATGATTATACTGAGGAGAAACAAGATTTGTTTCAGAAGT 


ACTTTAAGTATTGGGAT CAAGAGTATCACCCTAACTGT CGCGACTGTAGTGATGACAGGT 


GTTTGATACATTGTGCAAACTTCAACATCTTGTTTTCTACACTTGTACCGCAGACTTCTT 


TCGGTAATTTGTGTAGAAAGGTTTTTGTTGATGGTGTACCATTTATAGCTACTTGTGGCT 


ATCATTCTAAGGAACTTGGTGTTAT TATGAAT CAAGATAACACCATGTCATTTTCAAAAA 


TGGGTT TGAGTGAACTCATGGAGTT TGTTGGAGATCGTGGCTTGTTAGTGGGGACATGCA 


ATAAAT TAGTGGATCTTAGAACGTCTTGTTTTAGTGTTTGTGCTTTAGCGTCTGGTATTA 


CTCATCAAACGGTAAAACCAGGT CACT TTAACAAGGAT TT CTACGAT TT TGCAGAGAAGG 
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CTGGTATGTT TAAGGAAGGTTCTTCTATACCACT TAAACATTTCTTCTACCCACAGACTG 
GTAATGCTGCTATAAACGATTATGATTATTATCGTTATAACAGGCCTACCATGTTTGATA 
TACGTCAACTTTTATTTTGTT TAGAAGTGACTTCTAAATATTTTGAATGTTATGAAGGCG 
GCTGTATACCAGCAAGCCAAGTTGTAGTTAACAATT TAGATAAGAGTGCAGGTTATCCGT 


TCAATAAGTT TGGAAAGGCCCGTCTCTATTATGAAATGAGTCTAGAGGAGCAGGAC CAAC 


TCTTTGAGAGTACAAAGAAGAACGTCCTGCCTACTATAACTCAGATGAATT TAAAATATG 


CCATATCCGCGAAAAATAGAGCGCGTACAGTGGCAGGTGTGTCTATCCTTTCTACTATGA 


CTAATAGGCAGTTTCATCAGAAGATTCTTAAGTCTATAGT CAACACTAGAAACGCTCCTG 


TAGT TATTGGAACAACCAAGT TT TATGGCGGT TGGGATAACATGT TGAGAAACCTTATTC 


AGGGTGTTGAAGACCCGATTCTTATGGGTTGGGATTAT CCAAAGTGTGATAGAGCAATGC 


CTAATTTGTTGCGTATAGCAGCATCTT TAGTACTCGCTCGTAAACACACTAATTGTTGTA 


CTTGGTCTGAACGCGTT TATAGGTTGTATAATGAATGCGCTCAGGTTTTATCTGAAACTG 


TCTTAGCTACAGGTGGTATATATGTGAAACCTGGTGGTACTAGCAGTGGAGATGCTACTA 


CTGCTTATGCAAACAGTGTTTTCAACATAATACAAGCCACATCTGCTAATGTTGCGCGTC 


TTTTGAGTGT TATAACGCGTGATAT TGTATATGATGACAT TAAGAGCTTGCAGTATGAAT 


TGTACCAGCAGGTT TATAGGCGAGTCAATTTTGACCCAGCATTTGTTGAAAAGTTTTATT 


CTTATTTGTGTAAGAATTTCTCATTGATGATCTTGTCTGACGACGGTGTTGTTTGTTATA 


ACAACACATTAGCCAAACAAGGT CT TGTAGCAGATATTTCTGGTTTTAGAGAAGTTCTCT 


ACTATCAGAACAATGTTTTTATGGCTGATTCTAAATGTTGGGT TGAACCAGATTTAGAAA 


AAGGCCCACATGAATTTTGTTCACAGCACACAATGT TAGTGGAGGTTGATGGTGAGCCTA 


GATACTTGCCATATCCAGACCCATCACGTATTTTGTGTGCATGTGTTTTTGTAGATGATT 


TGGATAAGACAGAATCTGTGGCTGT TATGGAGCGTTATATCGCTCTTGCCATAGATGCGT 


ACCCACTAGTACAT CATGAAAATGAGGAGTACAAGAAGGTATTCTTTGTGCTTCTTTCAT 


ACATCAGAAAACTCTATCAAGAGCTTTCTCAGAATATGCTTATGGACTACTCTTTTGTAA 


TGGATATAGATAAGGGTAGTAAATT TTGGGAACAGGAGTTCTATGAAAATATGTATAGAG 


CCCCTACAACAT TACAGTGTTGTGGCGTTTGTGTAGTGTGTAATAGTCAAACTATATTGC 


GCTGTGGTAATTGTATTCGCAAACCATTTTTGTGTTGTAAGTGTTGCTATGACCATGTCA 


TGCACACAGACCACAAAAATGTTTTGTCTATAAATCCTTACATTTGCTCACAGCCAGGTT 


GTGGTGAAGCAGATGTTACTAAATTGTACCTCGGAGGTATGTCATACTTCTGCGGTAATC 


ATAAACCAAAGT TATCAATACCGTTAGTAT CTAATGGTACAGTGTTTGGAATTTACAGGG 


CTAATTGTGCAGGTAGCGAAAATGTTGATGATTT TAATCAACTAGCTACTACTAATTGGT 


CTACTGTGGAACCTTATATTTTGGCAAATCGTTGTGTAGATTCGTTGAGACGCTTTGCTG 


CAGAGACAGTAAAAGCTACAGAAGAAT TACATAAGCAACAATT TGCTAGTGCAGAAGTGA 


GAGAAGTACTCTCAGATCGTGAATTGATTCTGTCTTGGGAGCCAGGTAAAACCAGGCCTC 


CATTGAATAGAAATTATGTTTTCACTGGCTTTCACTTTACTAGAACTAGTAAAGTT CAGC 


TCGGTGATTT TACATTTGAAAAAGGTGAAGGTAAGGACGTTGTCTATTATCGAGCGACGT 


CTACTGCTAAATTGTCTGTTGGAGACATTTTTGTTTTAACCTCACACAATGTTGTTTCTC 


TTATAGCGCCAACGTTGTGTCCTCAGCAAACCTTTTCTAGGTTTGTGAATTTAAGACCTA 


ATGTGATGGTACCTGCGTGTT TTGTAAATAACAT TCCATTGTACCATTTAGTAGGCAAGC 


AGAAGCGTACTACAGTACAAGGCCCTCCTGGCAGTGGTAAATCCCATTTTGCTATAGGAT 
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TGGCGGCTTACTTTAGTAACGCCCGTGTCGTTTTTACTGCATGCTCTCATGCAGCTGTTG 


ATGCTTTATGTGAAAAAGCTTTTAAGTTTCTTAAAGTAGATGATTGCACTCGTATAGTAC 
CTCAAAGGACTACTATCGATTGCTTCTCTAAGTT TAAAGGTAATGACACAGGCAAAAAGT 
ACATTTTTAGTACTATTAATGCCTTGCCAGAAGT TAGTTGTGACATTCTTTTGGTTGACG 
AGGT TAGTATGT TGACCAATTACGAATTGTCTTTTATTAATGGTAAGATAAACTATCAAT 
ATGTTGTGTATGTAGGTGATCCTGCTCAAT TACCGGCGCCTCGTACGTTGCTTAACGGTT 
CACTCTCTCCAAAGGAT TATAATGTTGTCACAAACCTTATGGTTTGTGTTAAACCTGACA 
TTTTCCTTGCAAAGTGT TACCGTTGTCCTAAAGAAATTGTAGATACTGTTTCTACTCTTG 
TATATGATGGAAAGTTTAT TGCAAATAACCCGGAATCACGTCAGTGTTTCAAGGTTATAG 
TTAATAATGGTAAT TCTGATGTAGGACATGAAAGTGGCTCAGCCTACAACATAACTCAAT 
TAGAATTTGTGAAAGAT TT TGTCTGTCGCAATAAGGAATGGCGGGAAGCAACATTCATTT 
CACCTTATAATGCTATGAACCAGAGAGCCTACCGTATGCTTGGACTTAATGTTCAGACAG 
TAGACTCGTCTCAAGGT TCGGAGTATGATTATGTTATCTTTTGTGTTACTGCAGATTCGC 
AGCATGCACTGAATATTAACAGATT CAATGTAGCGCTTACAAGAGCCAAGCGTGGTATAC 
TAGTTGTCATGCGTCAGCGTGATGAACTATATTCAGCTCTTAAGTTTATAGAGCTTGATA 
GTGTAGCAAGTCTGCAAGGTACAGGCTTGTTTAAAATT TGCAACAAAGAGTTTAGTGGTG 
TTCACCCAGCTTATGCAGTCACAACTAAGGCT CT TGCTGCAACTTATAAAGTTAATGATG 
AACTTGCTGCACTTGTTAACGTGGAAGCTGGT TCAGAAATAACATATAAACATCTTATTT 
CTTTGTTAGGGT TTAAGATGAGTGT TAATGTTGAAGGCTGCCACAACATGTTTATAACAC 
GTGATGAGGCTATCCGCAACGTAAGAGGTTGGGTAGGT TT TGATGTAGAAGCAACACATG 
CTTGCGGTACTAACATTGGTACTAACCTGCCTTTCCAAGTAGGTTTCTCTACTGGTGCAG 
ACTTTGTAGTTACGCCTGAGGGACT TGTAGATACTTCAATAGGCAATAATTTTGAGCCTG 
TGAATTCTAAAGCACCTCCAGGTGAACAATTTAATCACTTGAGAGCGTTATTCAAAAGTG 
CTAAACCTTGGCATGTTGTAAGGCCAAGGATTGTGCAAATGTTAGCGGATAACCTGTGCA 
ACGTTTCAGATTGTGTAGTGTTTGTCACGTGGTGTCATGGCCTAGAACTAACCACTTTGC 
GCTATTTTGT TAAAATAGGCAAGGACCAAGTTTGTTCTTGCGGTTCTAGAGCAACAACTT 
TTAATTCTCATACTCAGGCTTATGCTTGTTGGAAGCATTGCTTGGGTTTTGATTTTGTTT 
ATAATCCACTCTTAGTGGATATT CAACAGTGGGGTTATTCTGGTAACCTACAATTTAACC 
ATGATT TGCATTGTAATGTGCATGGACACGCACATGTAGCTTCTGCGGATGCTATTATGA 
CGCGTTGTCT TGCAATTAATAATGCATTTTGTCAAGATGT CAACTGGGATT TAACTTACC 
CTCATATAGCAAATGAGGATGAAGT CAATT CTAGCTGTAGATATT TACAACGCATGTATC 
TTAATGCATGTGTTGATGCTCTTAAAGTTAACGTTGTCTATGATATAGGCAACCCTAAAG 
GTATAAAATGTGTTAGACGTGGAGACT TAAAT TT TAGATTCTATGATAAGAATCCAATAG 
TACCCAATGT CAAGCAGTT TGAGTATGACTATAATCAGCACAAAGATAAGTTTGCTGATG 
GTCTTTGTATGTTTTGGAATTGTAATGTGGATTGTTATCCCGACAATTCCTTAGTTTGTA 
GGTACGACACACGAAAT TTGAGTGTGTTTAACCTACCTGGTTGTAATGGTGGTAGCTTGT 
ATGT TAACAAGCATGCATTCCACACACCTAAATTTGAT CGCACTAGCTTTCGTAATTTGA 
AAGCTATGCCATTCTTTTTCTATGACTCATCGCCTTGCGAGACCATT CAAT TGGATGGAG 
TTGCGCAAGACCTTGTGTCAT TAGCTACGAAAGATTGTAT CACAAAATGCAACATAGGCG 


GTGCTGTT TGTAAAAAGCACGCACAAATGTATGCAGATTTTGTGACTTCTTATAATGCAG 


CTGTTACTGCTGGTTTTACTTTTTGGGTTACTAATAATTTTAACCCATATAATTTGTGGA 
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AAAGTTTTTCAGCTCTCCAGT CTATCGACAATATTGCT TATAATATGTATAAGGGTGGTC 
ATTATGATGCTATTGCAGGAGAAATGCCCACTAT CGTAACTGGAGATAAAGTTTTTGTTA 
TAGATCAAGGCGTAGAAAAAGCAGTTTTTTTTAATCAAACAATTCTGCCTAGATCTGTAG 
CGTTTGAGCTGTATGCGAAGAGAAATATTCGCACACTGCCAAACAACCGTATTTTGAAAG 
GTTTGGGTGTAGATGTGACTAATGGATTTGTAATTTGGGATTACACGAACCAAACACCAC 


TATACCGTAATACTGTTAAGGTATGTGCATATACAGACATAGAACCAAATGGCCTAATAG 


TGCTGTATGATGATAGATATGGTGATTACCAGTCTTTTCTAGCTGCTGATAATGCTGTTT 


TAGTTTCTACACAGTGT TACAAGCGGTATT CGTATGTAGAAATACCGTCAAACCTGCTTG 


TTCAGAACGGTATT CCGTTAAAAGATGGAGCGAACCTGTATGTTTATAAGCGTGTTAATG 


GTGCGTTTGTTACGCTACCTAACACAT TAAACACACAGGGTCGCAGTTATGAAACTTTTG 


AACCTCGTAGTGATGTTGAGCGTGATTTTCTCGACATGTCTGAGGAGAGTTTTGTAGAAA 


AGTATGGTAAAGAATTAGGTCTACAGCACATACTGTATGGTGAAGTTGATAAGCCCCAAT 


TAGGTGGT TTACACACTGT TATAGGTATGTGCAGACTTTTACGTGCGAATAAGTTGAACG 


CAAAGTCTGTTACTAATTCTGATTCTGATGTCATGCAAAATTATTTTGTATTGGCAGACA 


ATGGTTCCTACAAGCAAGTGTGTACTGTTGTGGATTTGCTGCTTGATGATTTCTTAGAAC 


TTCTTAGGAACATACTGAAAGAGTATGGTACTAATAAGTCTAAAGTTGTAACAGTGTCAA 


TTGATTACCATAGCATAAATTTTATGACTTGGTT TGAAGATGGCATTATTAAAACATGTT 


ATCCACAGCTTCAATCAGCATGGACGTGTGGT TATAATATGCCTGAACTTTATAAAGTTC 


AGAATTGTGTTATGGAACCTTGCAACATTCCTAATTATGGTGT TGGAATAGCGTTGCCAA 


GTGGTATTATGATGAATGTGGCAAAGTATACACAACTCTGTCAATACCTTTCGAAAACAA 


CAATGTGTGTACCGCATAATATGCGAGTAATGCATTTTGGAGCTGGAAGTGACAAAGGAG 


TGGCTCCAGGTAGTACTGTTCTTAAACAATGGCTCCCAGAAGGGACACTCCTTGTCGATA 


ATGATATTGTAGACTATGTGTCTGATGCACATGTTTCTGTGCTTTCAGATTGCAATAAAT 


ATAAGACAGAGCACAAGTTTGATCTTGTGATATCTGATATGTATACAGACAATGATTCAA 


AAAGAAAGCATGAAGGCGTGATAGCCAATAATGGCAATGATGACGTTTTCATATATCTCT 


CAAGTTTTCTTCGTAATAATTTGGCTCTAGGTGGTAGTTTTGCTGTAAAAGTGACAGAGA 


CAAGTTGGCACGAAGTTTTATATGACATTGCACAGGAT TGTGCATGGTGGACAATGTTTT 


GTACAGCAGTGAATGCCTCTTCTTCAGAAGCATTCTTGGTTGGTGTTAATTATTTGGGTG 


CAAGTGAAAAGGTTAAGGT TAGTGGAAAAACGCTGCACGCAAATTATATATTTTGGAGGA 


ATTGTAATTATTTACAAACCTCTGCTTATAGTATATTTGACGTTGCTAAGTTTGATTTGA 


GATTGAAAGCAACACCAGT TGTTAATT TGAAAAC TGAACAAAAGAGAGACTTAGTGTTTA 


ATTTAATTAAGTGTGGTAAGT TACTGGTAAGAGATGTTGGTAACACCTCTTTTACTAGTG 


TACCAAAGTGCCTT TAGACCACCTAATGGT TGGCATTTACACGGGGGTGCTTATGCGGTA 


GTTAATATTTCTAGCGAAT CTAATAATGCAGGCTCTTCACCTGGGTGTATTGTTGGTACT 


ATTCATGGTGGTCGTGTTGTTAATGCTTCTTCTATAGCTATGACGGCACCGTCATCAGGT 


ATGGCTTGGTCTAGCAGTCAGTTTTGTACTGCACACTGTAACTTTTCAGATACTACAGTG 


TTTGTTACACATTGTTATAAATATGATGGGTGTCCTATAACTGGCATGCTTCAAAAGAAT 


TTTTTACGTGTTTCTGCTATGAAAAATGGCCAGCTTTTCTATAATTTAACAGTTAGTGTA 


GCTAAGTACCCTACTTTTAAATCATTTCAGTGTGTTAATAATTTAACATCCGTATATTTA 


AATGGTGATCTTGTTTACACCTCTAATGAGACCACAGATGTTACATCTGCAGGTGTTTAT 
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TTTAAAGCTGGTGGACCTATAACTTATAAAGT TATGAGAGAAGTTAAAGCCCTGGCTTAT 


TTTGTTAATGGTACTGCACAAGATGTTATTTTGTGTGATGGATCACCTAGAGGCTTGTTA 
GCATGCCAGTATAATACTGGCAATTTTTCAGATGGCTTTTATCCTTTTATTAATAGTAGT 
TTAGTTAAGCAGAAGTT TATTGTCTAT CGTGAAAATAGTGTTAATACTACTTTTACGTTA 
CACAATTTCACTTTTCATAATGAGACTGGCGCCAACCCTAATCCTAGTGGTGTTCAGAAT 


ATTCAAACTTACCAAACACAAACAGCTCAGAGTGGTTATTATAATTTTAATTTTTCCTTT 


CTGAGTAGTTTTGTTTATAAGGAGTCTAATTTTATGTATGGATCTTATCACCCAAGTTGT 


AATTTTAGACTAGAAACTATTAATAATGGCTTGTGGTT TAATTCACTTTCAGTTTCAATT 


GCTTACGGTCCTCTTCAAGGTGGTTGCAAGCAATCTGTCTTTAGTGGTAGAGCAACTTGT 


TGTTATGCTTATTCATATGGAGGTCCTTCGCTGTGTAAAGGTGTTTATTCAGGTGAGTTA 


GATCTTAATTTTGAATGTGGACTGTTAGTTTATGTTACTAAGAGCGGTGGCTCTCGTATA 


CAAACAGCCACTGAACCGCCAGT TATAACT CGACACAATTATAATAATATTACTTTAAAT 


ACTTGTGTTGAT TATAATATATATGGCAGAACTGGCCAAGGTTTTATTACTAATGTAACC 


GACTCAGCTGTTAGTTATAAT TATCTAGCAGACGCAGGTTTGGCTATTTTAGATACATCT 


GGTTCCATAGACATCTTTGTTGTACAAGGTGAATATGGTCTTACTTATTATTAGGTTAAC 


CCTTGCGAAGATGTCAACCAGCAGTTTGTAGTTTCTGGTGGTAAATTAGTAGGTATTCTT 


ACTTCACGTAATGAGACTGGTTCTCAGCTTCTTGAGAACCAGTTTTACATTAAAATCACT 


AATGGAACACGTCGTTTTAGACGTTCTATTACTGAAAATGTTGGAAATTGCCCTTATGTT 


AGTTATGGTAAGTTTTGTATAAAACCTGATGGTT CAAT TGCCACAATAGTACCAAAACAA 


TTGGAACAGTTTGTGGCACCTTTACTTAATGT TACTGAAAATGTGCTCATACCTAACAGT 


TTTAATTTAACTGT TACAGATGAGTACATACAAACGCGTATGGATAAGGTCCAAATTAAT 


TGTCTGCAGTATGTTTGTGGCAATTCTCTGGATTGTAGAGATTTGTTTCAACAATATGGG 


CCTGTTTGTGACAACATATTGTCTGTAGTAAATAGTAT TGGTCAAAAAGAAGATATGGAA 


CTTTTGAATTTCTATTCTTCTACTAAACCGGCTGGTTTTAATACACCATTTCTTAGTAAT 


GTTAGCACTGGTGAGTTTAATATTTCTCTTCTGTTAACAACTCCTAGTAGTCCTAGAAGG 


CGTTCTTTTATTGAAGACCTTCTATTTACAAGCGTTGAATCTGTTGGATTACCAACAGAT 


GACGCATACAAAAATTGCACTGCAGGACCTTTAGGTTTTCTTAAGGACCTTGCGTGTGCT 


CGTGAATATAATGGTTTGCTTGTGTTGCCTCCCATTATAACAGCAGAAATGCAAATTTTG 


TATACTAGTTCTCTAGTAGCTTCTATGGCTTTTGGTGGTATTACTGCAGCTGGTGCTATA 


CCTTTTGCCACACAACTGCAGGCTAGAATTAATCACTTGGGTATTACCCAGTCACTTTTG 


TTGAAGAATCAAGAAAAAATTGCTGCTTCCTTTAATAAGGCCATTGGTCGTATGCAGGAA 


GGTTTTAGAAGTACATCTCTAGCAT TACAACAAATT CAAGATGTTGT TAATAAGCAGAGT 


GCTATTCTTACTGAGACTATGGCAT CACTTAATAAAAATTTTGGTGCTATTTCTTCTATG 


ATTCAAGAAATCTACCAGCAACT TGACGCCATACAAGCAAATGCTCAAGTGGATCGTCTT 


ATAACTGGTAGATTGTCATCACTTTCTGTTTTAGCATCTGCTAAGCAGGCGGAGCATATT 


AGAGTGTCACAACAGCGTGAGTTAGCTACT CAGAAAAT TAATGAGTGTGTTAAGTCACAG 


TCTATTAGGTACTCCTTTTGTGGTAATGGACGACATGTTCTAACCATACCGCAAAATGCA 


CCTAATGGTATAGTGTTTATACACTTTTCTTATACTCCAGATAGTTTTGTTAATGTTACT 


GCAATAGTGGGTTTTTGTGTAAAGCCAGCTAATGCTAGTCAGTATGCAATAGTACCCGCT 


AATGGTAGGGGTATTTT TATACAAGTTAATGGTAGT TACTACATCACAGCACGAGATATG 


TATATGCCAAGAGCTAT TACTGCAGGAGATATAGTTACGCTTACTTCTTGTCAAGCAAAT 
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TATGTAAGTGTAAATAAGACCGT CATTACTACAT TCGTAGACAATGATGATTTTGATTTT 
AATGACGAAT TGTCAAAATGGTGGAATGACACTAAGCATGAGCTACCAGACTTTGACAAA 
TTCAAT TACACAGTACCTATACT TGACATTGATAGTGAAATTGATCGTATTCAAGGCGTT 
ATACAGGGTCTTAATGACTCTTTAATAGACCTTGAAAAACTTTCAATACTCAAAACTTAT 
ATTAAGTGGCCTTGGTATGTGTGGT TAGCCATAGCTTTTGCCACTATTATCTTCATCTTA 
ATACTAGGATGGGTTTTCTTCATGACTGGATGTTGTGGTTGTTGTTGTGGATGCTTTGGC 
ATTATGCCTCTAATGAGTAAGTGTGGTAAGAAATCTTCTTATTACACGACTTTTGATAAC 
GATGTGGTAACT TAACAATACAGACCTAAAAAGTCTGTTTAATGATTCAAAGTCCCACGT 
CCTTCCTAATAGTATTAATTTTTCTTTGGTGTAAACTTGTACTAAGTTGTTTTAGAGAGT 
TTATTATAGCGCTCCAACAACTAATACAAGTTTTACTCCAAAT TATCAATAGTAACTTAC 
AGCCTAGACTGACCCTTTGTCACAGTCTAGACTAATGT TAAACTTAGAAGCAATTATTGA 
AACTGGTGAGCAAGTGATTCAAAAAAT CAGTTTCAATT TACAGCATATTTCAAGTGTATT 
AAACACAGAAGTATTTGACCCCTTTGACTATTGTTATTACAGAGGAGGTAATTTTTGGGA 
AATAGAGTCAGCTGAAGATTGTTCAGGTGATGATGAATTTATTGAATAAGTCGCTAGAGG 
AAAATGGAAGTTTTCTAACAGCGCTTTATATATTTGTAGGATTTTTAGCACTTTATCTTC 
TAGGTAGAGCACTTCAAGCATTTGTACAGGCTGCTGATGCTTGTTGTTTATTTTGGTATA 
CATGGGTAGTAATT CCAGGAGCTAAGGGTACAGCCTTTGTATATAAGTATACATATGGTA 
GAAAACTTAACAATCGGGAAT TAGAAGCAGTTATTGTCAACGAGTTTCCTAAGAACGGTT 
GGAATAATAAAAATCCAGCAAAT TT TCAAGATGTCCAACGAGACAAATTGTACTCTTGAC 
TTTGAACAGTCAGTTGAGCTTTTTAAAGAGTATAATTTATTTATAACTGCATTCTTGTTG 
TTCTTAACCATAATACTTCAGTATGGCTATGCAACAAGAAGTAAGTTTATTTATATACTG 
AAAATGATAGTGTTATGGTGCTTTTGGCCCCTTAACAT TGCAGTAGGTGTAATTTCATGT 
ATATACCCACCAAACACAGGAGGTCTTGTCGCAGCGATAATACTTACAGTGTTTGCGTGT 
CTGTCTTTTGTAGGTTATTGGAT CCAGAGTAT TAGACTCTTTAAGCGGTGTAGGTCATGG 
TGGTCATTTAACCCAGAATCTAATGCCGTAGGTT CAATACTCCTAACTAATGGTCAACAA 
TGTAATTTTGCTATAGAGAGTGTGCCAATGGTGCTTTCTCCAATTATAAAGAATGGTGTT 
CTTTATTGTGAGGGTCAGTGGCT TGCTAAGTGTGAACCAGACCACTTGCCTAAAGATATA 
TTTGTTTGTACACCGGATAGACGTAATATCTACCGTATGGTGCAGAAATATACTGGTGAC 
CAAAGCGGAAATAAGAAACGGTT TGCTACGTTTGTCTATGCAAAGCAGTCAGTAGATACT 
GGCGAGCTAGAAAGTGTAGCAACAGGAGGGAGTAGT CT TTACACCTAAATGTGTGTGTGT 
AGAGAGTATTTAAAATTATTCTTTAATAGTGCCTCTATTTTAAGAGCGCATAATAGTATT 
ATTTTTGAGGATAT TAATATAAATCCTCTCTGTTTTATACTCTCTTTTCAAGAGCTATTA 
TTTAAAAAACAGTTTTTCCACTCTTTTGTGCCAAAAACTATTGTTGT TAATGGTGTAACC 
TTTCAAGTAGATAATGGAAAAGT CTACTACGAAGGAAAACCAATT TT TCAGAAAGGTTGT 
TGTAGGTTGTGGTTGAGTTATAAAAAAGAT TAAACTACCTACTACACTTATTTTTATAAG 
AGGCGTTTTATCTTACAAGCGCT TAATAAATACGGACGATGAAATGGCTGACTAGTTTTG 
TAAGGGCAGT TATTTCATGTTATAAACCCCTATTATTAACTCAATTAAGAGTATTAGATA 
GGTTAATCTTAGATCATGGACCAAAACACATCTTAACGTGTGTTAGGTGCGTGATTTTGT 


TTCAATTAGATT TAGTT TATAGGTTGGCGTATACGCCTACTCAATCGCTGGTATGAATAA 


TAGTAAAGATAATCCTT TT TGCGGAGCAATAGCAAGAAAAGCGCGAATTTATCTGAGAGA 
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-continued 
AGGATTAGATTGTGTTTACTTTCTTAACAAAGCAGGACAAGCAGAGTCTTGTCCCGCGTG 


TACCTCTCTAGTAT TCCAGGGGAAAACTTGTGAGGAACACAAATATAATAATAATCTTTT 
GTCATGGCAAGCGGTAAGGCAAC TGGAAAGACAGATGCCCCAGCTCCAGTCATCAAACTA 
GGAGGACCAAAGCCACCTAAAGT TGGTTCTTCTGGAAATGTATCTTGGTTTCAAGCAATA 
AAAGCCAAGAAGTTAAATTCACCTCCGCCTAAGT TTGAAGGTAGCGGTGTTCCTGATAAT 
GAAAAT CTAAAACCAAGTCAGCAGCATGGATATTGGAGACGCCAAGCTAGGTTTAAGCCA 
GGTAAAGGTGGAAGAAAACCAGTCCCAGATGCTTGGTATTTTTAGTATACTGGAACAGGA 
CCAGCCGCTAACCTGAATTGGGGTGATAGCCAAGATGGTATAGTGTGGGTTGCTGGTAAG 
GGTGCTGATACTAAATT TAGATCTAATCAGGGTACTCGTGACTCTGACAAGTTTGACCAA 
TATCCGCTACGGTT TTCAGACGGAGGACCTGATGGTAATTTCCGTTGGGATTTCATTCCT 
CTGAAT CGTGGCAGGAGTGGGAGAT CAACAGCAGCT TCATCAGCAGCATCTAGTAGAGCA 
CCATCACGTGAAGT TTCGCGTGGTCGCAGGAGTGGT TCTGAAGATGATCTTATTGCTCGT 
GCAGCAAGGATAAT TCAGGAT CAGCAGAAGAAGGGT TCTCGCATTACAAAGGCTAAGGCT 
GATGAAATGGCTCACCGCCGGTATTGCAAGCGCAGTATTCCACCTAATTATAAGGTTGAT 
CAAGTGTTTGGTCCCCGTACTAAAGGTAAGGAGGGAAATTTTGGTGATGACAAGATGAAT 
GAGGAAGGTATTAAGGATGGGCGCGTTACAGCAATGCTCAACCTAGTTCCTAGCAGCCAT 
GCTTGTCTTTTCGGAAGTAGAGTGACGCCCAGACTTCAACCAGATGGGCTGCACTTGAAA 
TTTGAATTTACTACTGTGGTCCCACGTGATGATCCGCAGTTTGATAATTATGTAAAAATT 
TGTGATCAGTGTGT TGATGGTGTAGGAACACGTCCAAAAGATGATGAACCAAGACCAAAG 
TCACGCTCAAGT TCAAGACCTGCAACAAGAGGAAAT TCTCCAGCGCCAAGACAGCAGCGC 
CCTAAGAAGGAGAAAAAGC CAAAGAAGCAGGATGATGAAGTGGATAAAGCATTGACCTCA 
GATGAGGAGAGGAACAATGCACAGCTGGAATT TGATGATGAACCCAAGGTAATTAACTGG 
GGGGAT TCAGCGCTAGGAGAGAATGAACTT TGAGTAAAAT TGAATAGTAAGAGTTAAGGA 
AGATAGGCATGTAGCTTGATTACCTACATGTCTATCGCCAGGGAAATGTCTAATTTGTCT 
ACTTAGTAGCCTGGAAACGAACGGTAGACCCTTAGATTTTAATTTAGTTTAATTTTTAGT 
TTAGTTTAAGTTAGTTTAGAGTAGGTATAAAGATGCCAGTGGCGGGGCCACGCGGAGTAC 
GACCGAGGGTACAGCACTAGGACGCCCATTAGGGGAAGAGCTAAATTTTAGTTTAAGTTA 
AGTTTAATTGGCTATGTATAGTTAAAATTTATAGGCTAGTATAGAGT TAGAGCAAAAAAA 


AAAAAAAAAAAAAAAAAAAA 


Replicase 

In addition to the structural and accessory genes, two- 5° 
thirds of a coronavirus genome comprises the replicase gene 
(at the 5' end of the genome), which is expressed as two 
polyproteins, ppla and pplab, in which pplab is an exten- 
sion product of ppla as a result of a -1 ribosomal shift 
mechanism. The two polyproteins are cleaved by two types 55 
of virus-encoded proteinases usually resulting in 16 non- 
structural proteins (Nsp1l-16); IBV lacks Nsp1 thereby 
encoding Nsp2-16. 

Thus Gene 1 in IBV encodes 15 (16 in other coronavi- 
ruses) non-structural proteins (nsp2-16), which are associ- 60 
ated with RNA replication and transcription. 

The term ‘replicase protein’ is used herein to refer to the 
ppla and pplab polyproteins or individual nsp subunits. 

The term ‘replicase gene’ is used herein to refer to a 
nucleic acid sequence which encodes for replicase proteins. 65 
Asummary of the functions of coronavirus nsp proteins is 

provided in Table 1. 


Nsp 


Protein 


1 
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TABLE 1 


Key features 


Conserved within but not between coronavirus genetic 
groups; potential regulatory functions in the host cell. 
Dispensable for MHV and SARS-CoV replication in 
tissue culture 

Acidic domain; macro domain with ADRP and poly 
(ADP-ribose)-binding activities; one or two ZBD- 
containing papain-like proteases; Y domain 
Transmembrane domain 

3C-like main protease, homodimer 

Transmembrane domain 

Interacts with nsp8 to form a hexadecamer complex 
Noncannonical RNA polymerase; interacts with nsp7 to 
form a hexadecameric complex 

ssRNA-binding protein, dimer 

RNA-binding protein, homododecamer, zinc-binding 
domain, known to interact with nsp14 and nsp16 
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Nsp 
Protein Key features 


32 


Nsp-14 comprises a 3'-to-5' exoribonuclease (ExoN) 
active domain in the amino-terminal region. SARS-CoV 
ExoN has been demonstrated to have metal ion-dependent 
3'-to-5' exoribonuclease activity that acts on both single- 


11 Unknown 5 stranded and double-stranded RNA, but not on DNA. Nsp- 
12 RNA-dependent RNA polymerase z : oa : 
is Zinc-binding domain, NTPase, dNTPase, 5-to-3' RNA 14 has been shown to have proof-reading activity. This nsp 
and DNA helicase, RNA 5'-triphosphate has also been shown to have N7-methyltransferase (MT) 
14 3'-to 5' exoribonuclease, zinc-binding domain and N7- activity in the carboxyl-terminal region. 
methyltransferase Nsp-15 associated NendoU (nidoviral endoribonuclease, 
15 Uridylate-specific endoribonuclease, homohexamer 10 . ee 
16 Putative ribose-2'-O-methyltransferase specific for U) RNase activity has been reported for a 
number of coronaviruses, including SARS-CoV, MHV and 
The variant replicase gene encoded by the coronavirus of a davexiere bana lass . be Sigma 
the present invention comprises a mutation in one or more ant ly enue Dy oe 1008 be ete Was ne Cray a 
of the sections of sequence encoding nsp-10, nsp-14, nsp-15 15 the Presence, of Mg and Ca , NendoU cleaves at the 3 
or nsp-16. side of uridylate residues in both single-stranded and 
Nsp10 has RNA-binding activity and appears to be double-stranded RNA. The biologically relevant substrate(s) 
involved in homo and/or heterotypic interactions within of coronavirus NendoUs remains to be identified. 
other nsps from the ppla/pp1ab region. It adopts an a/B fold Nsp-16 has been predicted to mediate ribose-2'-O-meth- 
comprised of five a-helices, one 3,,-helix and three 5, yltransferase (2'-O-MTase) activity and reverse-genetics 
B-strands. Two zinc-binding sites have been identified that experiments have shown that the 2'-O-MTase domain is 
are formed by conserved cysteine residues and one histidine essential for viral RNA synthesis in HCoV-229E and SARS- 
residue (Cys-74/Cys-77/His-83/Cys-90; Cys-117/Cys-120/ CoV. The enzyme may be involved in the production of the 
Cys-128/Cys-130). The protein has been confirmed to bind cap 1 structures of coronavirus RNAs and it may also 
single-stranded and double-stranded RNA and DNA without 5; cooperate with NendoU and ExoN in other RNA processing 


obvious specificity. Nsp-10 can be cross-linked with nsp-9, 
suggesting the existing of a complex network of protein- 
protein interactions involving nsp-7, -8, -9 and -10. In 
addition, nsp-10 is known to interact with nsp-14 and 
nsp-16. 


pathways. 2'-O-MTase might also methylate specific RNAs 
to protect them from NendoU-mediated cleavage. 

The genomic and protein sequences for nsp-10, -14, -15 
and -16 are provided as SEQ ID NO: 2-5 and 6-9, respec- 
tively. 


(nsp-10 nucleotide sequence- nucleotides 11884-12318 of SEQ ID NO: 


1) 


SEQ ID NO: 


TCTAAAGGT CATGAGACAGAGGAAGTGGATGCTGTAGGCATTCTCTCACTTTGTTCTTTTGCAGTA 
GATCCTGCGGATACATATTGTAAATATGTGGCAGCAGGTAATCAACCTTTAGGTAACTGTGTTAAA 
ATGTTGACAGTACATAATGGTAGTGGTT TTGCAATAACATCAAAGCCAAGTCCAACTCCGGATCAG 
GATTCTTATGGAGGAGCTTCTGTGTGTCTTTATTGTAGAGCACATATAGCACACCTTGGCGGAGCA 
GGAAATT TAGATGGACGCTGTCAATT TAAAGGTTCTTTTGTGCAAATACCTACTACGGAGAAAGAT 
CCTGTTGGATTCTGTCTACGTAACAAGGTTTGCACTGTTTGTCAGTGTTGGATTGGTTATGGATGT 
CAGTGTGATTCACTTAGACAACCTAAACCTTCTGTTCAG 


(nsp-14 nucleotide sequence- nucleotides 16938-18500 of SEQ ID NO: 


1) 


SEQ ID NO: 


GGTACAGGCTTGTTTAAAAT TTGCAACAAAGAGTT TAGTGGTGTTCACCCAGCTTATGCAGTCACA 


ACTAAGGCTCTTGCTGCAACTTATAAAGTTAATGATGAACTTGCTGCACTTGTTAACGTGGAAGCT 


GGTTCAGAAATAACATATAAACATCTTATTTCTTTGTTAGGGTTTAAGATGAGTGTTAATGTTGAA 


GGCTGCCACAACATGTT TATAACACGTGATGAGGCTATCCGCAACGTAAGAGGT TGGGTAGGTTTT 


GATGTAGAAGCAACACATGCTTGCGGTACTAACATTGGTACTAACCTGCCTTTCCAAGTAGGTTTC 


TCTACTGGTGCAGACTTTGTAGTTACGCCTGAGGGACTTGTAGATACTTCAATAGGCAATAATTTT 


GAGCCTGTGAATTCTAAAGCACCT CCAGGTGAACAAT TTAATCACT TGAGAGCGTTATTCAAAAGT 


GCTAAACCTTGGCATGT TGTAAGGCCAAGGATTGTGCAAATGT TAGCGGATAACCTGTGCAACGTT 


TCAGATTGTGTAGTGTTTGTCACGTGGTGTCATGGCCTAGAACTAACCACTTTGCGCTATTTTGTT 


AAAATAGGCAAGGACCAAGTTTGTTCTTGCGGT TCTAGAGCAACAACTTTTAATTCTCATACTCAG 


GCTTATGCTTGTTGGAAGCATTGCTTGGGTTTTGATTTTGTTTATAATCCACTCTTAGTGGATATT 


CAACAGTGGGGTTATTCTGGTAACCTACAATTTAACCATGATTTGCATTGTAATGTGCATGGACAC 


GCACATGTAGCTTCTGCGGATGCTAT TATGACGCGTTGTCTTGCAATTAATAATGCATTTTGTCAA 


2 
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-continued 
GATGTCAACTGGGATTTAACTTACCCTCATATAGCAAATGAGGATGAAGTCAATTCTAGCTGTAGA 
TATTTACAACGCATGTATCTTAATGCATGTGTTGATGCTCTTAAAGT TAACGTTGTCTATGATATA 
GGCAACCCTAAAGGTAT TAAATGTGTTAGACGTGGAGACT TAAATTT TAGATTCTATGATAAGAAT 
CCAATAGTACCCAATGT CAAGCAGT TTGAGTATGAC TATAATCAGCACAAAGATAAGTTTGCTGAT 
GGTCTTTGTATGTTTTGGAAT TGTAATGTGGATTGTTATCCCGACAATTCCTTACTTTGTAGGTAC 
GACACACGAAAT TTGAGTGTGTT TAACCTACCTGGT TGTAATGGTGGTAGCTTGTATGTTAACAAG 
CATGCATTCCACACACCTAAATT TGATCGCACTAGCTT TCGTAATTTGAAAGCTATGCCATTCTTT 
TTCTATGACTCATCGCCTTGCGAGACCATT CAAT TGGATGGAGTTGCGCAAGACCTTGTGTCATTA 
GCTACGAAAGAT TGTAT CACAAAATGCAACATAGGCGGTGCTGTT TGTAAAAAGCACGCACAAATG 
TATGCAGATTTTGTGACTTCTTATAATGCAGCTGTTACTGCTGGTTTTACTTTTTGGGTTACTAAT 
AATTTTAACCCATATAATTTGTGGAAAAGTTTTTCAGCTCTCCAG 
(nsp-15 nucleotide sequence- nucleotides 18501-19514 of SEQ ID NO: 1) 

SEQ ID NO: 4 
TCTATCGACAATATTGCTTATAATATGTATAAGGGTGGTCATTATGATGCTATTGCAGGAGAAATG 
CCCACTATCGTAACTGGAGATAAAGTTTTTGTTATAGATCAAGGCGTAGAAAAAGCAGTTTTTTTT 
AATCAAACAATTCTGCCTACATCTGTAGCGTT TGAGCTGTATGCGAAGAGAAATATTCGCACACTG 
CCAAACAACCGTATTTTGAAAGGTT TGGGTGTAGATGTGACTAATGGATTTGTAATTTGGGATTAC 
ACGAACCAAACACCACTATACCGTAATACTGT TAAGGTATGTGCATATACAGACATAGAACCAAAT 
GGCCTAATAGTGCTGTATGATGATAGATATGGTGAT TACCAGTCTTTTCTAGCTGCTGATAATGCT 
GTTTTAGTTTCTACACAGTGT TACAAGCGGTATTCGTATGTAGAAATACCGTCAAACCTGCTTGTT 
CAGAACGGTATTCCGTTAAAAGATGGAGCGAACCTGTATGTTTATAAGCGTGTTAATGGTGCGTTT 
GTTACGCTACCTAACACAATAAACACACAGGGTCGAAGTTATGAAACTTTTGAACCTCGTAGTGAT 
GTTGAGCGTGATTTTCTCGACATGT CTGAGGAGAGT TT TGTAGAAAAGTATGGTAAAGAATTAGGT 
CTACAGCACATACTGTATGGTGAAGTTGATAAGCCCCAATTAGGTGGTTTCCACACTGTTATAGGT 
ATGTGCAGACTTTTACGTGCGAATAAGTTGAACGCAAAGTCTGTTACTAATTCTGATTCTGATGTC 
ATGCAAAATTATTTTGTATTGGCAGACAATGGTT CCTACAAGCAAGTGTGTACTGTTGTGGATTTG 
CTGCTTGATGATTTCTTAGAACTTCTTAGGAACATACTGAAAGAGTATGGTACTAATAAGTCTAAA 
GTTGTAACAGTGTCAATTGAT TACCATAGCATAAATTTTATGACTTGGTTTGAAGATGGCATTATT 
AAAACATGTTATCCACAGCTTCAA 
(nsp-16 nucleotide sequence- nucleotides 19515-20423 of SEQ ID NO: 1) 

SEQ ID NO: 5 
TCAGCATGGACGTGTGGTTATAATATGCCTGAACTT TATAAAGTT CAGAATTGTGTTATGGAACCT 
TGCAACATTCCTAATTATGGTGT TGGAATAGCGT TGCCAAGTGGTAT TATGATGAATGTGGCAAAG 
TATACACAACTCTGTCAATACCT TT CGAAAACAACAATGTGTGTACCGCATAATATGCGAGTAATG 
CATT TTGGAGCTGGAAGTGACAAAGGAGTGGTGCCAGGTAGTACTGTTCTTAAACAATGGCTCCCA 
GAAGGGACACTCCTTGTCGATAATGATATTGTAGACTATGTGTCTGATGCACATGTTTCTGTGCTT 
TCAGAT TGCAATAAATATAAGACAGAGCACAAGTTTGATCTTIGTGATATCTGATATGTATACAGAC 
AATGAT TCAAAAAGAAAGCATGAAGGCGTGATAGCCAATAATGGCAATGATGACGTTTTCATATAT 
CTCTCAAGTTTTCTTCGTAATAATTTGGCT CTAGGTGGTAGTTTTGCTGTAAAAGTGACAGAGACA 


AGTTGGCACGAAGT TTTATATGACATTGCACAGGAT TGTGCATGGTGGACAATGTTTTGTACAGCA 


GTGAATGCCTCTTCTTCAGAAGCATTCTTGAT TGGTGT TAATTATTTGGGTGCAAGTGAAAAGGTT 


AAGGTTAGTGGAAAAACGCTGCACGCAAAT TATATATT TTGGAGGAATTGTAATTATT TACAAACC 
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-continued 
TCTGCTTATAGTATATTTGACGTTGCTAAGTTTGATTTGAGATTGAAAGCAACGCCAGTTGTTAAT 


TTGAAAACTGAACAAAAGACAGACTTAGTCTT TAATTTAATTAAGTGTGGTAAGTTACTGGTAAGA 
GATGTTGGTAACACCTCTTTTACTAGTGACTCTTTTGTGTGTACTATGTAG 


(nsp-10 amino acid sequence) 


SEQ ID NO: 


SKGHETEEVDAVGI LSLCSFAVDPADTYCKYVAAGNQPLGNCVKMLTVKNGSGFAI TSKPSPTPDQ 
DSYGGASVCLYCRAHIAHPGGAGNLDGRCQFKGSFVQI PTTEKDPVGFCLRNKVCTVCQCWIGYGC 
QCDSLRQPKPSVQ 


(nsp-14 amino acid sequence) 


SEQ ID NO: 


GTGLFKICNKEFSGVHPAYAVTTKALAATYKVNDELAALVNVEAGSEITYKHLISLLGFKMSVNVE 
GCHNMF I TRDEAIRNVRGWVGFDVEATHACGTNIGTNLPFQVGFSTGADFVVTPEGLVDTSIGNNF 
EPVNSKAPPGEQFNHLRALFKSAKPWHVVRPRIVQMLADNLCNVSDCVVFVTWCHGLELTTLRYFV 
KIGKDOVCSCGSRATTFNSHTQAYACWKHCLGFDFVYNPLLVD IQQWGY SGNLOFNHDLHCNVHGH 
AHVASADAIMTRCLAINNAFCQDVNWDLTY PHIANEDEVNSSCRYLQORMYLNACVDALKVNVVYDI 
GNPKGI KCVRRGDLNFRFYDKNP IVPNVKQFEYD YNQHKDKFADGLCMFWNCNVDCYPDNSLVCRY 
DTRNLSVFNLPGCNGGS LYVNKHAFHT PKFDRTSFRNLKAMPFFFYDSSPCETIQLDGVAQDLVSL 
ATKDCITKCNICGAVCKKKAQMYADFVTSYNAAV TAGF TFWVTNNFNPYNLWKSFSALOQ 


(nsp-15 amino acid sequence) 


SEQ ID NO: 8 


SIDNIAYNMYKGGHYDAIAGEMPTIVTGDKVFVIDQGVEKAVFFNOQTILPTSVAFELYAKRNIRTL 
PNNRILKGLGVDVTNGFVIWDYTNQTPLYRNTVKVCAYTDIEPNGLIVLYDDRYGDYQSFLAADNA 
VLVSTQCYKRYS YVEIPSNLLVQNGI PLKDGANLYVYKRVNGAFVTLPNTLNTQGRSYETFEPRSD 
VERDFLDMSEES FVEKYGKELGLOQHILYGEVDKPQLGGLHTVIGMCRLLRANKLNAKSVTNSDSDV 
MQNYFVLADNGS YKQVCTVVDLLLDDFLELLRNI LKEYGTNKSKVVTVSIDYHSINFMTWFEDGII 
KTCYPQLQ 


(nsp-16 amino acid sequence) 


SEQ ID NO: 9 


SAWTCGYNMPELYKVQNCVMEPCNI PNYGVGIALPSGIMMNVAKYTQLCQYLSKTTMCVPHNMRVM 


HFGAGSDKGVAPGS TVLKQWLPEGTLLVDNDIVDYVSDAHVSVLSDCNKYKTEHKFDLVISDMYTD 


NDSKRKHEGV IANNGNDDVFI YLSSFLRNNLALGGS FAVKVTETSWHEVLYDIAQDCAWWTMFCTA 


VNASSSEAFLVGVNYLGAS EKVIWSGKTLHANYI FWRNCNYLOTSAYSIFDVAKFDLRLKATPVVN 
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LKTEQKTDLVFNLIKCGKLLVRDVGNTSFTSDSFVCTM 


Reduced Pathogenicity 


The live, attenuated coronavirus of the present invention 
comprises a variant replicase gene which causes the virus to 
have reduced pathogenicity compared to a coronavirus 
expressing the corresponding wild-type gene. 


The term “attenuated” as used herein, refers to a virus that 
exhibits said reduced pathogenicity and may be classified as 
non-virulent. A live, attenuated virus is a weakened repli- 
cating virus still capable of stimulating an immune response 
and producing immunity but not causing the actual illness. 


The term “pathogenicity” is used herein according to its 
normal meaning to refer to the potential of the virus to cause 
disease in a subject. Typically the pathogenicity of a coro- 
navirus is determined by assaying disease associated symp- 
toms, for example sneezing, snicking and reduction in 
tracheal ciliary activity. 


50 


55 


60 
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The term “reduced pathogenicity” is used to describe that 
the level of pathogenicity of a coronavirus is decreased, 
lessened or diminished compared to a corresponding, wild- 
type coronavirus. 

In one embodiment, the coronavirus of the present inven- 
tion has a reduced pathogenicity compared to the parental 
M41-CK virus from which it was derived or a control 
coronavirus. The control coronavirus may be a coronavirus 
with a known pathogenicity, for example a coronavirus 
expressing the wild-type replicase protein. 

The pathogenicity of a coronavirus may be assessed 
utilising methods well-known in the art. Typically, patho- 
genicity is assessed by assaying clinical symptoms in a 
subject challenged with the virus, for example a chicken. 

As an illustration, the chicken may be challenged at 8-24 
days old by nasal or ocular inoculation. Clinical symptoms, 
associated with IBV infection, may be assessed 3-10 days 
post-infection. Clinical symptoms commonly assessed to 
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determine the pathogenicity of a coronavirus, for example an 
IBV, include gasping, coughing, sneezing, snicking, depres- 
sion, ruffled feathers and loss of tracheal ciliary activity. 

The variant replicase of the present invention, when 
expressed in a coronavirus, may cause a reduced level of 5 
clinical symptoms compared to a coronavirus expressing a 
wild-type replicase. 

For example a coronavirus expressing the variant repli- 
case may cause a number of snicks per bird per minute 
which is less than 90%, less than 80%, less than 70%, less 
than 60%, less than 50%, less than 40%, less than 30%, less 
than 20% or less than 10% of the number of snicks caused 
by a virus expressing the wild type replicase. 

A coronavirus expressing a variant replicase according to 
the present invention may cause wheezing in less than 70%, 
less than 60%, less than 50%, less than 40%, less than 30%, 
less than 20% or less than 10% of the number of birds in a 
flock infected with the a virus expressing the wild type 
replicase. 

A coronavirus expressing a variant replicase according to 
the present invention may result in tracheal ciliary activity 
which is at least 60%, at least 70%, at least 80%, at least 90% 
or at least 95% of the level of tracheal ciliary activity in 
uninfected birds. 

A coronavirus expressing a variant replicase according to 
the present invention may cause clinical symptoms, as 
defined in Table 2, at a lower level than a coronavirus 
expressing the wild type replicase. 


TABLE 2 
IBV severity limits based on clinical signs: 


Snicking (sneezing) 

Nasal exudate 

Watery eyes 

Swollen infraorbital sinuses up to 7d). 
Rales (vibration in trachea or bronchi region) 


Hunched posture/depressed 
Fluffed up feathers 


derat 
Eating and drinking less moderate 


Drinking in excess: evident by fluid filled crop or 


measured water intake 


Less active but still evade capture 
Weight loss 

Not eating or drinking node: 
Birds sit alone and does not evade capture 

Severe respiratory distress: e.g. excessive gasping method. 
Snicking and/or rales for 7d in total 


Mild, if exceed 2d increase to 


Mild, if exceed 1d increase to 
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levels when administered to broiler chickens with maternal 
antibodies as these strains are neutralized by the maternal 
antibody pool. 

Thus a viral particle must be sufficiently efficient at 
replicating and propagating to ensure that it is not neutral- 
ized by the maternally-derived antibodies against the virus. 
Maternally-derived antibodies are a finite pool of effective 
antibodies, which decrease as the chicken ages, and neu- 
tralization of the virus in this manner does not equate to the 
establishment of long-term immunity for the embryo/chick. 
In order to develop long-term immunity against the virus, 
the embryo and hatched chicken must develop an appropri- 
ate protective immune response which is distinct to the effect 
of the maternally-derived antibodies. 

To be useful for in ovo vaccination, the virus must also not 
replicate and propagate at a level which causes it to be 
pathogenic to the embryo. 

Reduced pathogenicity in terms of the embryo may mean 
that the coronavirus causes less reduction in hatchability 
compared to a corresponding, wild-type control coronavirus. 
Thus the term “without being pathogenic to the embryo” in 
the context of the present invention may mean “without 
causing reduced hatchability” when compared to a control 
coronavirus. 

A suitable variant replicase may be identified using meth- 
ods which are known in the art. For example comparative 
challenge experiments following in ovo vaccination of 


IBV specific: Mild (N.B. Respiratory signs 
become apparent from 2-3 dpi if they 
are going to occur and can continue for 


IBV specific: Mild, if exceed 24h increase to 
moderate for a max of 2d. If still drinking in excess 
then kill by schedule 1 method. 


Moderate: birds at end point. Kill by schedule 1 


Severe: report to project license 


Found dead holder. 


Full post-mortem to be performed. 


The variant replicase of the present invention, when 
expressed in a coronavirus, may cause the virus to replicate 


at non-pathogenic levels in ovo. 60 


While developing vaccines to be administered in ovo to 
chicken embryos, attention must be paid to two points: the 
effect of maternal antibodies on the vaccines and the effect 
of the vaccines on the embryo. Maternal antibodies are 
known to interfere with active immunization. For example, 
vaccines with mild strains do not induce protective antibody 


65 


embryos with or without maternally-derived antibodies may 
be performed (i.e. wherein the layer has or has not been 
vaccinated against IBV). 

If the variant replicase enables the virus to propagate at a 
level which is too high, the embryo will not hatch or will not 
be viable following hatching (i.e. the virus is pathogenic to 
the embryo). A virus which is pathogenic to the embryo may 
kill the embryo. 

If the variant replicase causes a reduction in viral repli- 
cation and propagation which is too great, the virus will be 
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neutralised by the maternally-derived antibodies. Subse- 
quent challenge of the chick with IBV will therefore result 
in the development of clinical symptoms (for example 
wheezing, snicking, loss of ciliary activity) and the onset of 
disease in the challenged chick; as it will have failed to 
develop effective immunity against the virus. 

Variant 

As used herein, the term ‘variant’ is synonymous with 
‘mutant’ and refers to a nucleic acid or amino acid sequence 
which differs in comparison to the corresponding wild-type 
sequence. 

A variant/mutant sequence may arise naturally, or may be 
created artificially (for example by site-directed mutagen- 
esis). The mutant may have at least 70, 80, 90, 95, 98 or 99% 
sequence identity with the corresponding portion of the wild 
type sequence. The mutant may have less than 20, 10, 5, 4, 
3, 2 or 1 mutation(s) over the corresponding portion of the 
wild-type sequence. 

The term “wild type” is used to mean a gene or protein 
having a nucleotide or amino acid sequence which is iden- 
tical with the native gene or protein respectively (i.e. the 
viral gene or protein). 

Identity comparisons can be conducted by eye, or more 
usually, with the aid of readily available sequence compari- 
son programs. These commercially available computer pro- 
grams can calculate % identity between two or more 
sequences. A suitable computer program for carrying out 
such an alignment is the GCG Wisconsin Bestfit package 
(University of Wisconsin, U.S.A.; Devereux et al., 1984, 
Nucleic Acids Research 12:387). Examples of other soft- 
ware that can perform sequence comparisons include, but 
are not limited to, the BLAST package (see Ausubel et al., 
1999 ibid—Chapter 18), FASTA (Atschul et al., 1990, J. 
Mol. Biol., 403-410) and the GENEWORKS suite of com- 
parison tools, ClustalX (see Larkin et al. (2007) Clustal W 
and Clustal X version 2.0. Bioinformatics, 23:2947-2948). 
Both BLAST and FASTA are available for offline and online 
searching (see Ausubel et al., 1999 ibid, pages 7-58 to 7-60). 
However, for some applications, it is preferred to use the 
GCG Bestf it program. A new tool, called BLAST 2 
Sequences is also available for comparing protein and 
nucleotide sequence (see FEMS Microbiol Lett 1999 174(2): 
247-50; FEMS Microbiol Lett 1999 177(1): 187-8 and 
tatiana@ncbi.nlm.nih.gov). 

The sequence may have one or more deletions, insertions 
or substitutions of amino acid residues which produce a 
silent change and result in a functionally equivalent mol- 
ecule. Deliberate amino acid substitutions may be made on 
the basis of similarity in polarity, charge, solubility, hydro- 
phobicity, hydrophilicity, and/or the amphipathic nature of 
the residues as long as the activity is retained. For example, 
negatively charged amino acids include aspartic acid and 
glutamic acid; positively charged amino acids include lysine 
and arginine; and amino acids with uncharged polar head 
groups having similar hydrophilicity values include leucine, 
isoleucine, valine, glycine, alanine, asparagine, glutamine, 
serine, threonine, phenylalanine, and tyrosine. 

Conservative substitutions may be made, for example 
according to the Table below. Amino acids in the same block 
in the second column and preferably in the same line in the 
third column may be substituted for each other: 


ALIPHATIC Non-polar 


GA 
IL 
Polar- uncharged cs 
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Polar- charged DE 
KR 
AROMATIC HF WY 


The coronavirus of the present invention may comprise a 
variant replicase gene which encodes a protein which com- 
prises a mutation compared to any one of SEQ ID NO: 6, 7, 
8 or 9 which, when expressed in a coronavirus, causes the 
virus to have reduced pathogenicity compared to a corona- 
virus expressing the corresponding wild-type replicase. 

The variant replicase gene may encode a protein which 
comprises at least one or more amino acid mutations in any 
combination of nsp-10, nsp-14, nsp-15 and nsp-16. 

The variant replicase gene of the coronavirus of the 
present invention may encode a protein comprising a muta- 
tion as defined in the M41 mod sequences presented in FIG. 
10. 

The variant replicase gene of the coronavirus of the 
present invention may encode a protein which comprises 
one or more amino acid mutations selected from the list of: 

Pro to Leu at position 85 of SEQ ID NO: 6, 

Val to Leu at position 393 of SEQ ID NO: 7; 

Leu to Ile at position 183 of SEQ ID NO: 8; 

Val to Ile at position 209 of SEQ ID NO: 9. 

The variant replicase gene of the coronavirus of the 
present invention may encode a protein which does not 
comprise a mutation in nsp-2, nsp-3, nsp-6 or nsp-13. 

The variant replicase gene of the coronavirus of the 
present invention may encode a protein which does not 
comprise a mutation in nsp10 which corresponds to the 
threonine to isoleucine mutation caused by a mutation at 
nucleotide position 12,008 in the gene reported by Ammay- 
appan et al. (Arch Virol (2009) 154:495-499),. 

Ammayappan et al (as above) reports the identification of 
sequence changes responsible for the attenuation of IBV 
strain Arkansas DPI. The study identified 17 amino acid 
changes in a variety of IBV proteins following multiple 
passages, approx. 100, of the virus in embryonated eggs. It 
was not investigated whether the attenuated virus (Ark DPI 
101) is capable of replicating in the presence of maternally- 
derived antibodies against the virus in ovo, without being 
pathogenic to the embryo. Given that this virus was pro- 
duced by multiple passage in SPF embryonated eggs, similar 
methodology for classical IBV vaccines, it is likely that this 
virus is pathogenic for embryos. The virus may also be 
sensitive to maternally-derived antibodies if the hens were 
vaccinated with a similar serotype. 

The variant replicase gene of the coronavirus of the 
present invention may encode a protein which comprises 
any combination of one or more amino acid mutations 
provided in the list above. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutation Pro to Leu at position 85 
of SEQ ID NO: 6. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutation Val to Leu at position 
393 of SEQ ID NO: 7. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutation Leu to Ile at position 183 
of SEQ ID NO: 8. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutation Val to Ile at position 209 
of SEQ ID NO: 9. 
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The variant replicase gene may encode a protein which 
comprises the amino acid mutations Pro to Leu at position 
85 of SEQ ID NO: 6, and Val to Leu at position 393 of SEQ 
ID NO: 7. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutations Pro to Leu at position 
85 of SEQ ID NO: 6 Leu to Ile at position 183 of SEQ ID 
NO: 8. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutations Pro to Leu at position 
85 of SEQ ID NO: 6 and Val to Ie at position 209 of SEQ 
ID NO: 9. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutations Val to Leu at position 
393 of SEQ ID NO: 7 and Leu to Ile at position 183 of SEQ 
ID NO: 8. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutations Val to Leu at position 
393 of SEQ ID NO: 7 and Val to Ile at position 209 of SEQ 
ID NO: 9. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutations Leu to Ile at position 
183 of SEQ ID NO: 8 and Val to Ile at position 209 of SEQ 
ID NO: 9. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutations Pro to Leu at position 
85 of SEQ ID NO: 6, Val to Leu at position 393 of SEQ ID 
NO: 7 and Leu to Ile at position 183 of SEQ ID NO: 8. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutations Pro to Leu at position 
85 of SEQ ID NO: 6 Leu to Ile at position 183 of SEQ ID 
NO: 8 and Val to Ile at position 209 of SEQ ID NO: 9. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutations Pro to Leu at position 
85 of SEQ ID NO: 6, Val to Leu at position 393 of SEQ ID 
NO: 7 and Val to Ile at position 209 of SEQ ID NO: 9. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutations Val to Leu at position 
393 of SEQ ID NO: 7, Leu to Ile at position 183 of SEQ ID 
NO: 8 and Val to Ile at position 209 of SEQ ID NO: 9. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutations Pro to Leu at position 
85 of SEQ ID NO: 6, Val to Leu at position 393 of SEQ ID 
NO: 7, Leu to Ile at position 183 of SEQ ID NO: 8 and Val 
to Ile at position 209 of SEQ ID NO: 9. 

The variant replicase gene may also be defined at the 
nucleotide level. 

For example the nucleotide sequence of the variant rep- 
licase gene of the coronavirus of the present invention may 
comprise one or more nucleotide substitutions within the 
regions selected from the list of: 11884-12318, 16938- 
18500, 18501-19514 and 19515-20423 of SEQ ID NO:1. 

For example the nucleotide sequence of the variant rep- 
licase gene of the coronavirus of the present invention may 
comprise one or more nucleotide substitutions selected from 
the list of: 

C to Tat nucleotide position 12137; 

G to C at nucleotide position 18114; 

T to A at nucleotide position 19047; and 

G to A at nucleotide position 20139; 
compared to the sequence shown as SEQ ID NO: 1. 

As used herein, the term “substitution” is synonymous 
with the term mutation and means that the nucleotide at the 
identified position differs to that of the wild-type nucleotide 
sequence. 

The nucleotide sequence may comprise any combination 
of the nucleotide substitutions selected from the list of: 
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C to Tat nucleotide position 12137; 

G to Cat nucleotide position 18114; 

T to A at nucleotide position 19047; and 

G to A at nucleotide position 20139; 
compared to the sequence shown as SEQ ID NO: 1. 

The nucleotide sequence may comprise the substitution 
C12137T. 

The nucleotide sequence may comprise substitution 
G18114C. 

The nucleotide sequence may comprise the substitution 
T19047A. 

The nucleotide sequence may comprise the substitution 
G20139A. 

The nucleotide sequence may comprise the substitutions 
C12137T and G18114C. 

The nucleotide sequence may comprise the substitutions 
C12137T and T19047A. 

The nucleotide sequence may comprise the substitutions 
C12137T and G20139A. 

The nucleotide sequence may comprise the substitutions 
G18114C and T19047A. 

The nucleotide sequence may comprise the substitutions 
G18114C and G20139A. 

The nucleotide sequence may comprise the substitutions 
T19047A and G20139A. 

The nucleotide sequence may comprise the substitutions 
C12137T, G18114C and T19047A. 

The nucleotide sequence may comprise the substitutions 
C12137T, T19047A and G20139A. 

The nucleotide sequence may comprise the substitutions 
C12137T, G18114C and G20139A. 

The nucleotide sequence may comprise the substitutions 
G18114C, T19047A and G20139A. 

The nucleotide sequence may comprise the substitutions 
C12137T, G18114C, T19047A and G20139A. 

The nucleotide sequence may not comprise a substitution 
which corresponds to the C12008T substitution reported by 
Ammayappan et al. (as above). 

The nucleotide sequence may be natural, synthetic or 
recombinant. It may be double or single stranded, it may be 
DNA or RNA or combinations thereof. It may, for example, 
be cDNA, PCR product, genomic sequence or mRNA. 

The nucleotide sequence may be codon optimised for 
production in the host/host cell of choice. 

It may be isolated, or as part of a plasmid, virus or host 
cell. 

Plasmid 

A plasmid is an extra-chromosomal DNA molecule sepa- 
rate from the chromosomal DNA which is capable of 
replicating independently of the chromosomal DNA. They 
are usually circular and double-stranded. 

Plasmids, or vectors (as they are sometimes known), may 
be used to express a protein in a host cell. For example a 
bacterial host cell may be transfected with a plasmid capable 
of encoding a particular protein, in order to express that 
protein. The term also includes yeast artificial chromosomes 
and bacterial artificial chromosomes which are capable of 
accommodating longer portions of DNA. 

The plasmid of the present invention comprises a nucleo- 
tide sequence capable of encoding a defined region of the 
replicase protein. It may also comprise one or more addi- 
tional coronavirus nucleotide sequence(s), or nucleotide 
sequence(s) capable of encoding one or more other corona- 
virus proteins such as the S gene and/or gene 3. 

The plasmid may also comprise a resistance marker, such 
as the guanine xanthine phosphoribosyltransferase gene 
(gpt) from Escherichia coli, which confers resistance to 
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mycophenolic acid (MPA) in the presence of xanthine and 
hypoxanthine and is controlled by the vaccinia virus P7.5 
early/late promoter. 

Recombinant Vaccinia Virus 

The present invention also relates to a recombinant vac- 
cinia virus (rVV) comprising a variant replicase gene as 
defined herein. 

The recombinant vaccinia virus (rVV) may be made using 
a vaccinia-virus based reverse genetics system. 

In this respect, the present invention also provides a 
method for making a viral particle by: 

(i) transfecting a plasmid as described in the previous 

section into a host cell; 

(ii) infecting the host cell with a recombining virus 
comprising the genome of a coronavirus strain with a 
replicase gene; 

(iii) allowing homologous recombination to occur 
between the replicase gene sequences in the plasmid 
and the corresponding sequences in the recombining 
virus genome to produce a modified replicase gene; 

(iv) selecting for recombining virus comprising the modi- 
fied replicase gene. 

The term ‘modified replicase gene’ refers to a replicase 
gene which comprises a variant replicase gene as described 
in connection with the first aspect of the present invention. 
Specifically, the term refers to a gene which is derived from 
a wild-type replicase gene but comprises a nucleotide 
sequence which causes it to encode a variant replicase 
protein as defined herein. 

The recombination may involve all or part of the replicase 
gene. For example the recombination may involve a nucleo- 
tide sequence encoding for any combination of nsp-10, 
nsp-14, nsp-15 and/or nsp-16. The recombination may 
involve a nucleotide sequence which encodes for an amino 
acid mutation or comprises a nucleotide substitution as 
defined above. 

The genome of the coronavirus strain may lack the part of 
the replicase protein corresponding to the part provided by 
the plasmid, so that a modified protein is formed through 
insertion of the nucleotide sequence provided by the plas- 
mid. 

The recombining virus is one suitable to allow homolo- 
gous recombination between its genome and the plasmid. 
The vaccinia virus is particularly suitable as homologous 
recombination is routinely used to insert and delete 
sequences for the vaccinia virus genome. 

The above method optionally includes the step: 

(v) recovery of recombinant coronavirus comprising the 
modified replicase gene from the DNA from the recom- 
bining virus from step (iv). 

Methods for recovering recombinant coronavirus, such as 
recombinant IBV, are known in the art (See Britton et al 
(2005) see page 24; and PCT/GB2010/001293). 

For example, the DNA from the recombining virus from 
step (iv) may be inserted into a plasmid and used to transfect 
cells which express cytoplasmic T7 RNA polymerase. The 
cells may, for example be pre-infected with a fowlpox virus 
expressing T7 RNA polymerase. Recombinant coronavirus 
may then be isolated, for example, from the growth medium. 

When the plasmid is inserted into the vaccinia virus 
genome, an unstable intermediate is formed. Recombinants 
comprising the plasmid may be selected for e.g. using a 
resistance marker on the plasmid. 

Positive recombinants may then be verified to contain the 
modified replicase gene by, for example, PCR and sequenc- 
ing. 
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Large stocks of the recombining virus including the 
modified replicase gene (e.g. recombinant vaccinia virus, 
(rVV) may be grown up and the DNA extracted in order to 
carry out step (v)). 

Suitable reverse genetics systems are known in the art 
(Casais et al (2001) J. Virol 75:12359-12369; Casais et al 
(2003) J. Virol. 77:9084-9089; Britton et al (2005) J. Viro- 
logical Methods 123:203-211; Armesto et al (2008) Methods 
in Molecular Biology 454:255-273). 

Cell 

The coronavirus may be used to infect a cell. 

Coronavirus particles may be harvested, for example from 
the supernatant, by methods known in the art, and optionally 
purified. 

The cell may be used to produce the coronavirus particle. 

Thus the present invention also provides a method for 
producing a coronavirus which comprises the following 
steps: 

(i) infection of a cell with a coronavirus according to the 
invention; 

(i1) allowing the virus to replicate in the cell; and 

(iii) harvesting the progeny virus. 

The present invention also provides a cell capable of 
producing a coronavirus according to the invention using a 
reverse genetics system. For example, the cell may comprise 
a recombining virus genome comprising a nucleotide 
sequence capable of encoding the replicase gene of the 
present invention. 

The cell may be able to produce recombinant recombining 
virus (e.g. vaccinia virus) containing the replicase gene. 

Alternatively the cell may be capable of producing recom- 
binant coronavirus by a reverse genetics system. The cell 
may express or be induced to express T7 polymerase in 
order to rescue the recombinant viral particle. 

Vaccine 

The coronavirus may be used to produce a vaccine. The 
vaccine may by a live attenuated form of the coronavirus of 
the present invention and may further comprise a pharma- 
ceutically acceptable carrier. As defined herein, “pharma- 
ceutically acceptable carriers” suitable for use in the inven- 
tion are well known to those of skill in the art. Such carriers 
include, without limitation, water, saline, buffered saline, 
phosphate buffer, alcohol/aqueous solutions, emulsions or 
suspensions. Other conventionally employed diluents and 
excipients may be added in accordance with conventional 
techniques. Such carriers can include ethanol, polyols, and 
suitable mixtures thereof, vegetable oils, and injectable 
organic esters. Buffers and pH adjusting agents may also be 
employed. Buffers include, without limitation, salts prepared 
from an organic acid or base. Representative buffers include, 
without limitation, organic acid salts, such as salts of citric 
acid, e.g., citrates, ascorbic acid, gluconic acid, histidine- 
Hel, carbonic acid, tartaric acid, succinic acid, acetic acid, or 
phthalic acid, Iris, trimethanmine hydrochloride, or phos- 
phate buffers. Parenteral carriers can include sodium chlo- 
ride solution, Ringer’s dextrose, dextrose, trehalose, 
sucrose, and sodium chloride, lactated Ringer’s or fixed oils. 
Intravenous carriers can include fluid and nutrient replen- 
ishers, electrolyte replenishers, such as those based on 
Ringer’s dextrose and the like. Preservatives and other 
additives such as, for example, antimicrobials, antioxidants, 
chelating agents (e.g., EDTA), inert gases and the like may 
also be provided in the pharmaceutical carriers. The present 
invention is not limited by the selection of the carrier. The 
preparation of these pharmaceutically acceptable composi- 
tions, from the above-described components, having appro- 
priate pH isotonicity, stability and other conventional char- 
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acteristics is within the skill of the art. See, e.g., texts such 
as Remington: The Science and Practice of Pharmacy, 20th 
ed, Lippincott Williams & Wilkins, pub!., 2000; and The 
Handbook of Pharmaceutical Excipients, 4.sup.th edit., eds. 
R. C. Rowe et al, APhA Publications, 2003. 

The vaccine of the invention will be administered in a 
“therapeutically effective amount”, which refers to an 
amount of an active ingredient, e.g., an agent according to 
the invention, sufficient to effect beneficial or desired results 
when administered to a subject or patient. An effective 
amount can be administered in one or more administrations, 
applications or dosages. A therapeutically effective amount 
of a composition according to the invention may be readily 
determined by one of ordinary skill in the art. In the context 
of this invention, a “therapeutically effective amount” is one 
that produces an objectively measured change in one or 
more parameters associated Infectious Bronchitis condition 
sufficient to effect beneficial or desired results. An effective 
amount can be administered in one or more administrations. 
For purposes of this invention, an effective amount of drug, 
compound, or pharmaceutical composition is an amount 
sufficient to reduce the incidence of Infectious Bronchitis. 
As used herein, the term “therapeutic” encompasses the full 
spectrum of treatments for a disease, condition or disorder. 
A “therapeutic” agent of the invention may act in a manner 
that is prophylactic or preventive, including those that 
incorporate procedures designed to target animals that can 
be identified as being at risk (pharmacogenetics); or in a 
manner that is ameliorative or curative in nature; or may act 
to slow the rate or extent of the progression of at least one 
symptom of a disease or disorder being treated. 

The present invention also relates to a method for pro- 
ducing such a vaccine which comprises the step of infecting 
cells, for example Vero cells, with a viral particle comprising 
a replicase protein as defined in connection with the first 
aspect of the invention. 

Vaccination Method 

The coronavirus of the present invention may be used to 
treat and/or prevent a disease. 

To “treat” means to administer the vaccine to a subject 
having an existing disease in order to lessen, reduce or 
improve at least one symptom associated with the disease 
and/or to slow down, reduce or block the progression of the 
disease. 

To “prevent” means to administer the vaccine to a subject 
who has not yet contracted the disease and/or who is not 
showing any symptoms of the disease to prevent or impair 
the cause of the disease (e.g. infection) or to reduce or 
prevent development of at least one symptom associated 
with the disease. 

The disease may be any disease caused by a coronavirus, 
such as a respiratory disease and and/or gastroenteritis in 
humans and hepatitis, gastroenteritis, encephalitis, or a 
respiratory disease in other animals. 

The disease may be infectious bronchitis (IB); Porcine 
epidemic diarrhoea; Transmissible gastroenteritis; Mouse 
hepatitis virus; Porcine haemagglutinating encephalomyeli- 
tis; Severe acute respiratory syndrome (SARS); or Blue- 
comb disease. 

The disease may be infectious bronchitis. 

The vaccine may be administered to hatched chicks or 
chickens, for example by eye drop or intranasal administra- 
tion. Although accurate, these methods can be expensive e.g. 
for large broiler flocks. Alternatives include spray inocula- 
tion of administration to drinking water but it can be difficult 
to ensure uniform vaccine application using such methods. 


10 


15 


20 


25 


30 


35 


40 


45 


50 


55 


60 


65 


46 


The vaccine may be provided in a form suitable for its 
administration, such as an eye-dropper for intra-ocular use. 

The vaccine may be administered by in ovo inoculation, 
for example by injection of embryonated eggs. In ovo 
vaccination has the advantage that it provides an early stage 
resistance to the disease. It also facilitates the administration 
of a uniform dose per subject, unlike spray inoculation and 
administration via drinking water. 

The vaccine may be administered to any suitable com- 
partment of the egg, including allantoic fluid, yolk sac, 
amnion, air cell or embryo. It may be administered below the 
shell (aircell) membrane and chorioallantoic membrane. 

Usually the vaccine is injected into embryonated eggs 
during late stages of embryonic development, generally 
during the final quarter of the incubation period, such as 3-4 
days prior to hatch. In chickens, the vaccine may be admin- 
istered between day 15-19 of the 21-day incubation period, 
for example at day 17 or 18. 

The process can be automated using a robotic injection 
process, such as those described in WO 2004/078203. 

The vaccine may be administered together with one or 
more other vaccines, for example, vaccines for other dis- 
eases, such as Newcastle disease virus (NDV). The present 
invention also provides a vaccine composition comprising a 
vaccine according to the invention together with one or more 
other vaccine(s). The present invention also provides a kit 
comprising a vaccine according to the invention together 
with one or more other vaccine(s) for separate, sequential or 
simultaneous administration. 

The vaccine or vaccine composition of the invention may 
be used to treat a human, animal or avian subject. For 
example, the subject may be a chick, chicken or mouse (such 
as a laboratory mouse, e.g. transgenic mouse). 

Typically, a physician or veterinarian will determine the 
actual dosage which will be most suitable for an individual 
subject or group of subjects and it will vary with the age, 
weight and response of the particular subject(s). 

The composition may optionally comprise a pharmaceu- 
tically acceptable carrier, diluent, excipient or adjuvant. The 
choice of pharmaceutical carrier, excipient or diluent can be 
selected with regard to the intended route of administration 
and standard pharmaceutical practice. The pharmaceutical 
compositions may comprise as (or in addition to) the carrier, 
excipient or diluent, any suitable binder(s), lubricant(s), 
suspending agent(s), coating agent(s), solubilising agent(s), 
and other carrier agents that may aid or increase the delivery 
or immunogenicity of the virus. 

The invention will now be further described by way of 
Examples, which are meant to serve to assist one of ordinary 
skill in the art in carrying out the invention and are not 
intended in any way to limit the scope of the invention. 


EXAMPLES 


Example 1—Generation of an IBV Reverse 
Genetics System Based on M41-CK 


A M41-CK full-length cDNA was produced by replace- 
ment of the Beaudette cDNA in the Vaccinia virus reverse 
genetics system previously described in PCT/GB2010/ 
001293 (herein incorporated by reference) with synthetic 
cDNA derived from the M41 consensus sequence. 

The IBV cDNA within recombinant Vaccinia virus (rVV) 
rVV-BeauR-Rep-M41 structure described in Armesto, 
Cavanagh and Britton (2009). PLoS ONE 4(10): e7384. 
doi:10.1371/journal.pone.0007384, which consisted of the 
replicase derived from IBV Beaudette strain and the struc- 
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tural and accessory genes and 3' UTR from IBV M41-CK, 
was further modified by replacement of the Beaudette 5' 
UTR-Nsp2-Nsp3 sequence with the corresponding sequence 
from IBV M41-CK. The resulting IBV cDNA consisted of 
5' UTR-Nsp2-Nsp3 from M41, Nsp4-Nsp16 from Beaudette 
and the structural and accessory genes and 3' UTR from 
M41. This cDNA was further modified by the deletion of the 
Beaudette Nsp4-Nsp16 sequence. The resulting cDNA, 
lacking Nsp4-16, was modified in four further steps in which 
the deleted Nsps were sequentially replaced with the corre- 
sponding sequences from M41-CK, the replacement cDNAs 
represented M41-CK Nsp4-8, Nsp9-12, Nsp12-14 and 
finally Nsp15-16. Each replacement cDNA contained 
approx. 500 nucleotides at the 5' end corresponding to the 3' 
most M41 sequence previously inserted and approx. 500 
nucleotides at the 3' end corresponding to the M41 S gene 
sequence. This allowed insertion of the M41 cDNA 
sequence by homologous recombination and sequential 
addition of contiguous M41 replicase gene sequence. The 
synthetic cDNAs containing the M4l-derived Nsp 
sequences were added by homologous recombination utilis- 
ing the inventor’s previous described transient dominant 
selection (IDS) system (see PCT/GB2010/001293). The 
M41-derived cDNAs containing sequence corresponding to 
the M41 Nsps-10, -14, -15 and -16 contained the modified 
amino acids at positions 85, 393, 183 and 209, respectively, 
as indicated in FIG. 10. 

A full-length cDNA representing the genome of M41-CK 
was generated in Vaccinia virus representing the synthetic 
sequences. Two rIBVs, M41-R-6 and M41-R-12, were res- 
cued and shown to grow in a similar manner as M41-CK 
(FIG. 1). 


Example 2—Determining the Pathogenicity of 
Rescued M41 Viruses 


The viruses rescued in Example 1 were used to infect 
8-day-old specific pathogen free (SPF) chicks by ocular and 
nasal inoculation to test them for pathogenicity, as observed 
by clinical signs on a daily basis 3-7 days post-infection and 
for ciliary activity days 4 and 6 post-infection. Loss of 
ciliary activity is a well-established method for determining 
the pathogenicity of IBV. The two M41-R viruses were 
found to be apathogenic when compared to M41-CK though 
they did show some clinical signs in comparison to unin- 
fected control chicks (FIG. 2) and some but inconsistent loss 
in ciliary activity (FIG. 3). 

Thus, the M41-R molecular clones of M41-CK were not 
pathogenic when compared to the parental virus M41-CK. 

The inventors identified several nucleotide differences in 
the M41-R compared to the M41-CK sequences. The major- 
ity of these were synonymous mutations, as the nucleotide 
change did not affect the amino acid sequence of the protein 
associated with the sequence. However, four non-synony- 
mous mutations were identified in the IBV replicase gene 
specific to Nsp-10, Nsp-14, Nsp-15 and Nsp-16 components 
of the replicase gene, these mutations resulted in amino acid 
changes (Table 3). 


TABLE 3 


Non-Synonymous mutations identified in the Nsps of M41-R 
full-length genome 


Region of Nucleotide Nucleotide 

Replicase position Mutation Amino Acid Change 
Nsp10 12137 CT ProLeu 
Nsp14 18114 GC ValLeu 
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TABLE 3-continued 


Non-Synonymous mutations identified in the Nsps of M41-R 
full-length genome 


Region of Nucleotide Nucleotide 

Replicase position Mutation Amino Acid Change 
Nsp15 19047 ToA Leulle 
Nsp16 20139 GA Val—lle 


Example 3—Repair of M41-R rIBVs 


In order to determine whether the identified mutations 
were responsible for the loss of pathogenicity associated 
with M41-R, the Nsp10 mutation was repaired and the 
mutations in Nsp-14, -15 & -16 were repaired and shown to 
grow in a similar manner as M41-CK (FIG. 9). The inven- 
tors thus generated the rIBVs, M41R-nsp10rep and M41R- 
nsp14, 15, 16rep, using synthetic cDNAs containing the 
correct nucleotides utilising the inventor’s previous 
described (TDS) system (see PCT/GB2010/001293). 

The rIBVs were assessed for pathogenicity in chicks as 
described previously. Both rIBVs showed increased patho- 
genicity when compared to M41-R but not to the level 
observed with M41-CK (FIGS. 4 and 5). M41R-nsp14, 15, 
l6rep gave more clinical signs and more reduction in ciliary 
activity than M41R-nsp10rep, overall these results indicated 
that the changes associated with the four Nsps appear to 
affect pathogenicity. 

To determine the roles of the Nsps in pathogenicity the 
full-length cDNA corresponding to M41R-nsp10rep was 
used to repair the mutations in Nsps14, 15 & 16 using a 
synthetic cDNA containing the correct nucleotides utilising 
the TDS system. 

The following rIBVs were produced: 

M41R-nsp10, 15rep—M41-R with the mutations in Nsp- 
10 and Nsp-15 repaired 

M41R-nsp10, 14, 15rep—M41-R with mutations in Nsp- 
10, -14 and -15 repaired 

M41R-nsp10, 14, 16rep—M41-R with mutations in Nsp- 
10, -14 and -16 repaired 

M41R-nsp10, 15, 16rep—M41-R with mutations in Nsp- 
10, -15 and -16 repaired 

M41-K—AIl four mutations, Nsp-10, -14, -15 & -16 
repaired in M41-R 

The rIBVs were shown to grow in a similar manner as 
M41-CK (FIG. 9) and assessed for pathogenicity as 
described previously. M41-K (in which all four mutations 
had been repaired) resulted in clinical signs and 100% loss 
of ciliary activity (complete ciliostasis) by 4 days post- 
infection (FIGS. 6, 7 & 8). The other rIBVs demonstrated 
varying levels of pathogenicity, apart from M41R-nsp10, 15, 
16rep, which was essentially apathogenic. These results 
confirmed that repair of all four Nsps restored pathogenicity 
to M41-R; again supporting the previous evidence that the 
mutations described in the four Nsps are implicated in 
attenuating M41-CK. 

The inventors also generated rIBV M41R-nsp 10, 14 rep 
(nsp 10 and 14 are repaired, nsp 15 and 16 contain muta- 
tions) and rIBV M41R-nsp 10, 16 rep (nsp 10 and 16 are 
repaired, nsp 14 and 15 contain mutations) and assessed the 
pathogenicity of these viruses. 

rIBV M41R-nsp 10, 14 rep less pathogenic than M41-K 
but caused around 50% ciliostasis on days 4-6 post-infec- 
tion. rIBV M41R-nsp 10, 16 rep was almost apathogenic and 
caused no ciliostasis (see FIG. 11a-c). 
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Thus the genome associated with M41-R is a potential 
backbone genome for a rationally attenuated IBV. 


Example 4—-Vaccination/Challenge Study with 
M41-R 


Candidate vaccine viruses were tested in studies in which 
fertilized chicken eggs were vaccinated in ovo at 18 days 
embryonation and in which the hatchability of the inoculated 
eggs was determined. The clinical health of the chickens was 
investigated and the chickens were challenged at 21 days of 
age with a virulent IB M41 challenge virus at 10°°° EID,, 
per dose. 

Clinical signs were investigated after challenge protection 
by the vaccine and a ciliostasis test was performed at 5 days 
after challenge to investigate the effect of the challenge 
viruses on movement of the cilia and protection by the 
vaccine against ciliostasis (inhibition of cilia movement). 

In Ovo Vaccination in Commercial Broiler Eggs 

The design of the experiment is given in Table 4 and the 
clinical results are given in Table 5. Hatchability of the eggs 
inoculated with IB M41-R was good and chickens were 
healthy. IB M41-R protected against clinical signs after 
challenge in the broilers (placebo: 19/19 affected, 1B M41- 
R: 3/18 affected and 1 dead). The results of the ciliostasis 
test are given in Table 6. IB M41-R generated protection 
against ciliostasis. 


TABLE 4 


Design of a hatchability, safety, efficacy study in commercial eggs 
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TABLE 6 


Results of the ciliostasis test after challenge, for design see Table 1. 


Treatment Protected/total Percentage protection 
Saline 0/19 0% 
IB M41R 5/18 28% 


In Ovo Vaccination in Specific Pathogen-Free (SPF) Eggs 


The design of the study in SPF eggs is given in Table 7 
and is similar with the design of the studies with commercial 
broilers, but the vaccination dose for 1B M41-R was higher, 
(10° EID, per dose). 


The results (Table 8) show that the hatch percentage for 
IB M41-R hatch was low, and 19 of 40 hatched and the 
chicks were weak. Eight chicks died. The remaining 11 
chickens were challenged and 11 of the chicks hatched from 
the eggs which had been inoculated with saline were chal- 
lenged. 


In the ciliostasis test after challenge it appeared that all 
chickens vaccinated in ovo with IB M41-R were protected, 
whereas none of the controls was protected, see Table 9. 


EIDso! Route Day(s) Day(s) End Nr. of 

Treatment per of of of of eggs per 
Treatment Description dose Admin Admin Challenge? Study treatment 
TO None NA NA NA NA NA 30 
TO2 IB M41-R 104 In ovo 18 days At 21 days At 26 30 
NTX Saline NA In ovo embryo- of age, 20 days 30 

nation chickens of age 
per group 
'Dose volume 0.1 ml, NA, not applicable. 
*0° EIDso per dose. 
TABLE 5 


Hatch percentages and clinical data before and after 


challenge in commercial chickens, for design see Table 1. 


Before After 
challenge challenge 
Hatch/ Vital/ Deaths/ Symptoms/ Deaths/ Symptoms/ 
Treatment total total total total total total 
None 28/30 Euthanized directly after hatch for blood collection 
IB M41-R 28/30 28/28 1/20 0/19 1/19 3/187 
Saline 29/30 29/29 1/20 o/19 O/19 19/191 2-3-4 5 6 7 


Disturbed respiratory system 
?Whizzing 

3Change of voice 

‘Breathing difficult 

Swollen intra-orbital sinuses 
Uneven growth 

7Weak 
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Design of a hatchability, safety, efficacy study in SPF eggs 


EIDs9! Route Day Day End 
Treatment per of of of of 
Treatment Description dose Admin Admin Challenge? — Study 
TOL IB M41-R 105 In ovo 18 days At 21 days At 26 
embryo- of age days 
T04 Saline NA In ovo _ nation of age 
NTX NA NA NA NA 
'Dose volume 0.1 ml, NA, not applicable. 
2Challenge dose 103 ElDso in 0.2 ml. 
TABLE 8 


Hatch percentages and clinical data before and after 


Nr. of 


eggs per 
treatment 


40 


40 
10 


challenge in SPF chickens, for design see Table 7. 


Before After 
challenge challenge 
Hatch/ Vital/ Deaths/ Symptoms/ _—_Deaths/ Symptoms/ 
Treatment total total total total total total 
IB M41-R 19/40 11/40 8/40 weak 0 0 
Saline 30/40 30/40 0 _ 0 0 
NA 9/10 9/10 0 — — _— 
TABLE 9 the batch of SPF eggs for viruses, as in other studies the level 


Results of the ciliostasis test after challenge, for design see Table 7. 


Treatment Protected/total Percentage protection 
Saline O/11 0% 
IB M41R 11/11 100% 


In conclusion, IB M41-R was safe in commercial eggs, 
generated protection against clinical signs and to an extent 
against ciliostasis. 


In SPF eggs vaccinated with IB M41 R a relatively low 
number of chickens hatched. This may be due to the 10° 
EID., per egg of 1B M41-R used. This was 10-fold higher 
than the dose used in earlier studies in which there was a 
higher level of hatchability. The lower hatch percentages 
may also be caused by a particularly high susceptibility of 
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of embryo mortality was also higher that had previously 
been observed. 

After challenge all surviving chickens after hatch were 
completely protected against ciliostasis. It is concluded that 
IB M41-R has great potential as vaccine to be administered 
in ovo. 

All publications mentioned in the above specification are 
herein incorporated by reference. Various modifications and 
variations of the described methods and system of the 
invention will be apparent to those skilled in the art without 
departing from the scope and spirit of the invention. 
Although the invention has been described in connection 
with specific preferred embodiments, it should be under- 
stood that the invention as claimed should not be unduly 
limited to such specific embodiments. Indeed, various modi- 
fications of the described modes for carrying out the inven- 
tion which are obvious to those skilled in molecular biology, 
virology or related fields are intended to be within the scope 
of the following claims. 


SEQUENCE LISTING 


<160> NUMBER OF SEQ ID NOS: 13 
<210> 
<211> 
<212> 


<213> 


SEQ ID NO 1 

LENGTH: 27500 
TYPE: DNA 
ORGANISM: Infectious bronchitis virus 


<400> SEQUENCE: 1 


acttaagata gatattaata tatatctatc acactagect tgcgctagat ttccaactta 60 
acaaaacgga cttaaatacc tacagctggt cctcataggt gttccattge agtgcacttt 120 
agtgecctgg atggcacctg gecacctgte aggtttttgt tattaaaate ttattgttge 180 
tggtatcact gettgttttg ccgtgtctca ctttatacat ccgttgcttg ggctacctag 240 
tatccagegt cctacgggeg cegtggctgg ttcgagtgcg aagaacctct ggttcatcta 300 


geggtaggeg 
ggggtcacct 
egcataaggt 
aggttctggt 
ctaaaacagg 
gaacaacttt 
gettttgcag 
gaaactgtgt 
gcaaaagtct 
cectttgcaa 
gaaacactgg 
tggcttcagg 
gaagtaactg 
tttaaacaac 
gaattaccac 
actgttgtgg 
agcattaatg 
aaaattttca 
ccaaatgcac 
gttgttgtge 
gtggagtcag 
ggtgateget 
tgtgagegtg 
ettgcagcaa 
gaaccattta 
acattagcga 
aaagtagaag 
tatggaaaag 
gegattgtga 
atctataaag 
ggtttttgtg 
aaaggagttg 
agttttaaga 
gtgcacaaaa 
gttgaagatec 
acacttccag 
atgttcttecec 
gctattaatg 
caagaaatac 


ccatggaata 


ggtgtgtgga 
ecccccacat 
eggctatacg 
ggtgtttagt 
gagtatctcce 
gtgacgettt 
ttaggcagaa 
gtggtetectt 
taaaagecac 
gaaaatatcg 
atgctegtge 
tggcagcaaa 
caaaagtcat 
aaatagtcag 
agegtattge 
ttatggagag 
gtgctgttge 
ctacacttge 
egagaggcac 
gtggcatgeg 
aaggttggtc 
tttacgegge 
ttgtectgtct 
tetactctte 
agtttctggg 
aggctgctac 
atgtttggtec 
tgcgtaatct 
ttttagegac 
taggtggtct 
cacagttgaa 
cacagcattg 
agtgtgcact 
ttattcaaga 
tgggtgttgt 
agaaccaacc 
gcttcaaaaa 
tggtttgcaa 
caccacctga 


caatcttcaa 
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agtagcactt 
acctctaagg 
acgtttgtag 
gagcagacat 
caaactaagg 
gtttttctat 
gtttgategt 
ectcttgaag 
ttctaagttg 
tgaacttttg 
acaaactctt 
aatccaagtt 
ggatgetttg 
aatttttcaa 
agcacttaag 
gactctagtt 
aaaattcttt 
ettctttagg 
taaagggttt 
aaatgactta 
tgeaattttg 
acctctttca 
ttetgatggt 
ttttagtgtc 
tcataaattt 
tattgcagat 
ttcacttact 
cgaagaattt 
agtgcttgga 
ttttactaaa 
aagagctaag 
ttttcaacta 
tggtagaatc 
gggegatgaa 
tcaagaaaaa 
ceggtcatatg 
ggatgagaac 
ageaggeggt 
tgttgtgttt 


aaaggcttat 
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cagacgtace 
gettttgage 
ggggtagtge 
acaatagaca 
gatgtcatte 
acgtcacaca 
aatctgcaga 
ggagttgaca 
gcagatttag 
aagacagcat 
gatgaaattt 
teggctatgg 
ggctcaaata 
aaagegctgg 
atggcttttg 
gttagagagt 
gaagaactcec 
gaggctgceag 
gaagtcgttg 
acactgcttg 
ggtggacatec 
ggaaattttg 
gtaacaccgg 
gcagaacttg 
gtgtatgcaa 
gttttgaage 
gaaaagtctt 
gttaagactt 
gagggcattt 
gttgttgact 
ctcattgtca 
ttgctggatg 
catggagact 
atttggtttg 
ttgattgatt 
gttcaaatcg 
atttattata 
aaaactgtca 
attaaggtta 


aaggagecca 


-continued 


ggttetgttg 
ctagegttgg 
caaacaacce 
gtgacaacat 
ttgtatccaa 
acectaagga 
ctgggaaaca 
aaataacacc 
aagacatctt 
gecagtggtc 
ttgaccctac 
egatgegcag 
tgagtgetct 
ctatttttga 
ctaagtgtge 
tegcaggaac 
caaatggttt 
tgaaaattgt 
gtaatgcecaa 
accaaaaage 
tttgetatgt 
cattgcatga 
agataaatga 
tggcagecat 
aggatgcage 
tgtttcaatec 
ttgaattctg 
gtttttgtaa 
ggcatcttgt 
tttgtgaaaa 
ctgaaaccct 
caatacagtt 
tgetcttctg 
acgcecattga 
ttgatgtttg 
aggatgacgg 
caccaatgtc 
ectttggaga 
gceattgagtg 


ttgaagtaga 


tgtgaaatac 
gctacgttct 
ctgaggtgac 
ggcttcaage 
agacattcct 
ttacgctgat 
gttcaaattt 
tggegtccca 
tggtgtetect 
tettactgta 
tgaaatactt 
gettgttgga 
tttccagatt 
gaatgtgagt 
caagtccatt 
ttgtettgea 
catgggtget 
ggataacata 
aggtacacaa 
tgaaattcct 
ctttaagagt 
tgtgeattgt 
tggacttatt 
taaaaggggt 
agtttctttt 
agegegtgtg 
gaggettgca 
ggctcaaatg 
ttcegcaagtc 
atattggaaa 
etgtgttttg 
tatgtataaa 
gaaaggaggt 
tagtattgat 
tgataatgtg 
aaagaactac 
acagettggt 
aactactgtg 
ttgtggtgaa 


gacagacctec 


360 


420 


480 


540 


600 


660 


720 


780 


840 


900 


960 


1020 


1080 


1140 


1200 


1260 


1320 


1380 


1440 


1500 


1560 


1620 


1680 


1740 


1800 


1860 


1920 


1980 


2040 


2100 


2160 


2220 


2280 


2340 


2400 


2460 


2520 


2580 


2640 


2700 
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acagttgaac 


ceggaggctec 
gatttggatt 
atcgaggaag 
actaattcag 
ceggcaagta 
attgttcata 
gtcaataact 
ggtgactggg 
gtcatatctg 
getattgttg 
tttacacctg 
etcatttctg 
gagcaagage 
gttaaaccag 
ttggetgttg 
gcaaatgage 
ceggactttg 
gtcacacctt 
ggagacagca 
gtggttaact 
tcaatagatg 
etgagtcaag 
acggaggatg 
tttggacagg 
gaaatectct 
caaaagtatg 
aattttgtta 
ettcaagcetg 
ggtggagatec 
gatttttcag 
acaaatgcac 
aggggtcttg 


caatattcaa 


ttacegtact 


gctgtaggga 


gatggtaagg 


attacagcaa 


catagtaagg 


aattgctctc 
cagaaccacc 
gcataaaatc 
aagatgcaga 
aatgtgaaga 
acaaatatcc 
aggacgctct 
getttgaagg 
gtgaggetgt 
agaaatcagt 
aagagcagga 
cagacctaga 
etgtccectaa 
etacectagt 
ctacatgtga 
taattgeocaa 
acatgtcgca 
ttgaatattg 
catttgttaa 
acttgegtga 
atgttgtgcec 
ctatgegega 
aacacatcga 
gtgttaaata 
tttttgcaag 
ttatacccac 
taacatattt 
tattagagtg 
ctaaaattag 
ctacagactt 
atgctaattg 
ttcttaagaa 
aagectgtat 
attgceccaac 
tattgetttt 
etgttgtttt 
cttttgacaa 
tgtatacacg 


gtaaagctaa 


mF! 


tgtggtctat 
accatttgag 
atgccatctg 
agaatgtgac 
agaagatgag 
tetgectett 
egatgttgtg 
ggctgttaaa 
tgatgcegcaa 
tgagggtttt 
aatagtacct 
agttgttaaa 
agaagaagtt 
tgttaaagca 
aaaacccaaa 
agcattggat 
tggtggtgge 
egeggactat 
aggcattcaa 
gaagettgtt 
agttctctca 
agcttttaaa 
ttatttcgat 
cegetctgtt 
aaataaggta 
aactgacaag 
gcaaacgett 
gegtgacgga 
atttaaaggt 
tgttgectgg 
gettttggec 
gtgtgtgtcg 
tcagecagtt 
etgtggtgca 
tgctactgat 
cattggctct 
tettgctaag 


tttttctctt 


agtagtaaaa 
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gagaaaatgt 
aatgtcacac 
atctatcgtg 
acggattcag 
gatactaaag 
gatgatgatt 
aatttaccat 
getcttcege 
gaacaattgt 
actggtagtt 
gttgttgaac 
gaaacagcag 
gtgtctcagg 
caacgtgaga 
tttttggagt 
gagtttaaag 
gttgcaaagg 
gttaagaaac 
tgtgtgaata 
getgcttaca 
tcagggattt 
ggttgtgcca 
gcaacttgta 
gttttaaaac 
gtcttttegg 
actattcttg 
gegcagaaat 
aattgctgga 
tttcttgcag 
tgttatgcaa 
aatttagcag 
tgcaattgtg 
egagcaccta 
agtagtacgg 
ggtectgcta 
actaatagtg 
gatagaaaat 
aggagtgaaa 


gaagatgttt 


-continued 


gtgatgatct 
ttgttgataa 
attatgagag 
gtgatgctga 
tgttggetct 
atagegtcta 
ctggtgaaga 
agaaagttat 
gtcaacaaga 
gtgatgcaat 
aaagtcagga 
aagaggttga 
agaaagagga 
agaaggctaa 
acaaaacatg 
agttctgcat 
caattgcaga 
atggtccaca 
atgttgtagg 
agagtgttct 
ttggtgtaga 
tacgegttct 
agcagaagac 
etggtgattec 
etgatgatgt 
aatattatgg 


gggatgttca 


ttagttcage 


aagcatggge 
gttgcaatge 
aacattttga 
gtgttaagag 
atcttctaca 
atgaagtaat 
cagttgattg 
gecattgtta 
ttggaaggaa 
atcccctact 


ctaaccttge 


caagetgttt 
gaatggtaaa 
egatgatgac 
ggagtgtgac 
tatacaagac 
caatggatgt 
aacctttgtt 
tgatgttcta 
atcaactcgg 
ggctgaacaa 
tgtagttgtt 
tgagtttatt 
gecacaggtt 
aaagttcaaa 
tgtgggtgat 
tgtaaacget 
ettttgtgga 
gcaaaaactt 
acctcgecat 
tgtaggtgga 
ttttaaaata 
tttattttct 
aatttatctt 
tttgggtcaa 
tgaggataaa 
tttagatgeg 
atatagagac 
aatagttcte 
taaactgttg 
taaagtaggt 
egcagattac 
ttatgaactt 
ttttaaaacg 
agaagcttca 
tgatgaaaat 
tacacaagec 
gtegecttac 
tgttgttgaa 


tactagttct 


2760 


2820 


2880 


2940 


3000 


3060 


3120 


3180 


3240 


3300 


3360 


3420 


3480 


3540 


3600 


3660 


3720 


3780 


3840 


3900 


3960 


4020 


4080 


4140 


4200 


4260 


4320 


4380 


4440 


4500 


4560 


4620 


4680 


4740 


4800 


4860 


4920 


4980 


5040 


56 


aaagecagtt 
agtcttaaag 
gaagattcta 
aggtctaagg 
ccagtctact 
gctaattttg 
tgggaaaatg 
etttggegtg 
aaagctattg 
gctacatacg 
aagggtcttt 
aagacacttg 
agctataaga 
tacacaagta 
tetttatgtg 
geaggtgatt 
catgcttata 
aattggettt 
atttgttatt 
tttctagatt 
tatttetgge 
gtaacatgtg 
gttggtggac 
agacataatt 
ectgaagttg 
gettaccacg 
getgcaattec 
tttttaaaga 
ggttttatag 
gtctattatg 
caattaatag 
aatataatat 
etgetttcta 
gaagtggaat 
gataagttga 
ttaagagtca 
gacagttgec 
acaaagtctg 
aaggcaggtg 


tttgtctttt 


ttgacgatct 
tgcaggagac 
agttgecact 
atggttttac 
acccagtctt 
ttgttgggca 
cegagagett 
cagaacacct 
ttggatctag 
ttgccgataa 
gtggattcac 
tgttcttttt 
ttgtgttatg 
atccagtagt 
gtccttataa 
ttacttgtcg 
gegtagaaca 
atttggtctt 
gtgttaagta 
ggtttgtaaa 
tettttacaa 
aagtgtgcaa 
gcaagcaaat 
ggtattgtag 
etggegaget 
ttgtgtatga 
etggtaagga 
aagetgtttt 
tgtgtaatac 
egcaatatct 
tagagectgt 
ctgtagatac 
ttactaaaga 
acactggtga 
cacctegtga 
aaaatgctce 
ttaaatattt 
gtgctaaaca 
gtgttattaa 


atatactttt 
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tactgacttt 
acctgataat 
gacacttaaa 
ttataagtta 
ggattctatt 
tcecaaattat 
tgttaaaatg 
taataaacct 
tgttgttact 
agtaggtgat 
acgtggecat 
ettttatttc 
taaggtggtg 
gtttactgga 
tgactacggt 
tgtgtgttta 
aatttataag 
tctaatatta 
tttggtattg 
aacagttttt 
gatatacgta 
gagagttgca 
agtgeatgtt 
aaattgtgat 
ttctgaaaag 
ggcatgegtg 
taatgeatct 
tcttaaggag 
acagagtgcg 
gtgtaageca 
gtctaagagt 
tgcagcttta 
cgaagaagec 
eggttttact 
tagagggttt 
teceggtagta 
aatttcagcet 
agttatttct 


taacactttt 


tacagcatgt 
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gaacagtggt 
cttgatgaat 
gttagaggta 
acacctgata 
agtcttaggg 
tatagtaagt 
ggttataaaa 
aatttggaga 
acgcagtgtg 
ggtgtagtte 
tttgaaaaga 
ttaaaggcta 
tttgctacct 
atacgtgtge 
aaagattctt 
catgatagag 
gatgcagcett 
tttgttaage 
agttcaactg 
acccatttta 
caagtgcatec 
egcagcaaca 
tacactaatt 
gattatggtc 
ettaagegec 
gttgatgatt 
tetgcetgtta 
gcattgaaat 
catgcactag 
atacttatac 
gttatagata 
aattataagg 
gtagatatgg 
aatgtgatac 
ttgataaatg 
tggaagtttt 
actgtcaagt 
tgtcatacce 
aaatggttta 


tgtttgggtt 


-continued 


atgatagcaa 
atgtgtcatt 
tcaaatcagt 
ctgatgaaaa 
caatatgggt 
ctctccgaat 
ttgatggtgt 
gaatttttaa 
gtaaaatact 
gcaatattac 
aaatgtcect 
gtgctaagag 
tacttatagt 
tagacttcct 
ttgatgtgtt 
attcacttca 
ctggcattaa 
cagtggcagg 
tgttgcaaac 
attttatggg 
atatattgta 
ggcaagaggt 
ctggctataa 
accaaaatac 
atgttaaacc 
ttgttaattt 
agtgtttcag 
gtgaacaaat 
aggaagcaaa 
ttgaccagge 
aagtgtgtag 
caggcacact 
ctatcttctg 
egtcatatgg 
cagatgcettc 
ctgatcttat 
caggaggtcg 
agaaactgtt 
tgagttgttt 


actactatat 


catctatgag 
tacgacaaag 
tgttgacttt 
ttcaaaaaca 
tgaaggcagt 
teccacgttt 
aactatggge 
cattgctaag 
agttaaagca 
agatagaatt 
acaatttcta 
tttagtttct 
gtggtttata 
atttgaaggt 
acgctattgt 
tetgtacaaa 
etttaattgg 
ttttgttatt 
tggtgtaggt 
agegggattt 
ctgtaaggat 
tagegttgta 
ettttgtaag 
atttatgtcc 
tacagcatat 
aaaatataag 
tgttacagat 
atctaatgat 
gaatgcagcc 
actttatgag 
cattttgtct 
tegtgatgcet 
ccacaatcat 
tatggacact 
tattgctaat 
taaattgtct 
tttctttata 
ggtagagaaa 
taaatggctt 


ggagatgaat 


5100 


5160 


5220 


5280 


5340 


5400 


5460 


5520 


5580 


5640 


5700 


5760 


5820 


5880 


5940 


6000 


6060 


6120 


6180 


6240 


6300 


6360 


6420 


6480 


6540 


6600 


6660 


6720 


6780 


6840 


6900 


6960 


7020 


7080 


7140 


7200 


7260 


7320 


7380 


7440 
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aaaagttttg 
gttatagaca 
tttgttaatt 
gttacagttg 
tgggttatgg 
attcctacct 
ggtagttttt 
aatacaccte 
ggtagtatta 
caacaaatac 
agtgtatgtg 
tttaatgatg 
atgtttaata 
gcaactatgt 
caaggtgttt 
aatgcatttg 
etctattget 
ettgttttta 
attctttata 
aagttgtatg 
gttattcgtg 
tatetttctg 
ttgcaagett 
gaggttgttt 
aaaaaactag 
ggcaataatec 
ggtaagttta 
gaagttgtaa 
gttttaattt 
gctaattgtg 
tacectgtca 
ggctcagttg 
gagttaccta 
gtagatgaag 
geatggetct 
ttggagagta 
actccatttt 
tgtaaactce 


ggacaatata 


ttcaccccat 
aaggtgttat 
ttgacgectt 
ttatagatgg 
atggtgttat 
ggtttaatag 
atacatctat 
aattgtattg 
ttectcatag 
tgcatacace 
agtatactaa 
aatacattag 
tggttagtac 
ttttaatact 
ttaaagctta 
ttttgtgtgt 
atgcatcatt 
cetttggttt 
tgtacacace 
atggcaacga 
gtactgaatt 
egtatgctag 
gtegtgcatg 
ataccccacec 
tttctcctag 
ttaatggact 
gtggtgacca 
ctcaaaatgg 
tacaaactge 
gtgatagttt 
etatgegtte 
gttttaatat 
atgcattaca 
aggttgegca 
atgcggcaat 
ctactgttte 
ecactagtac 
ttegceactat 


attttgaaga 


59 


gtatgatgta 
tagagagatt 
ttggggtaaa 
tgacgggaca 
gtttgtgeat 
agaaattgtt 
ageattattt 
ttttaatgge 
agtatacttec 
ctacatagtg 
accaggttac 
taaacctgge 
attctttact 
agttgttatt 
tgcgaccatt 
acatagttat 
ggttacaagt 
aatagtacce 
gttggttttc 
gtttgttggt 
tgttaagett 
acttaaatac 
gttagcttat 
gegttactct 
tagtgctgtt 
gtggetgggt 
gtggggtgac 
tgttactttg 
agttgccaat 
cactatagct 
taatggtact 
agaaaagggt 
cactggaact 
aagagtgcca 
tattagtgtt 
tattgaagat 
tgctattact 
tatggtaaaa 


cgaattgaca 
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aactccacac 
gtgtcagaag 
tcatatgaaa 
gtagctgttg 
atgacacaga 
ggttacactc 
tetgctagat 
gacaatgatg 
caacctaatg 
aagtttgttt 
tgtgtgtcac 
gttttctgtg 
ggtgtcaace 
gtgttaattt 
gtgtttacaa 
aatagtgttt 
egcaatactg 
acatggttgg 
tggtgttacg 
aattatgacec 
acgaatgaga 
tattcaggca 
getttggace 
attggtgtta 
gagaagtgca 
gattctattt 
gtactaaacc 
aatgttgtca 
gctgaaactc 
tgttcttatg 
attagagcat 
gtagttaatt 
gacctaatgg 
ccagataatc 
aaagaaagta 
tacaataggt 
aaattaagtg 
agtgctcaat 


ccagaatctg 


-continued 


tgcatgttga 
ataattgttt 
ataataaaaa 
gtgttecctgg 
ctgatcgtag 
aggattcaat 
gtttatattt 
cacctggage 
gtgttagget 
cagacagcta 
tagactccca 
gttctactgt 
ctaatattta 
ttgcaatggt 
taatgttagt 
tagetgttat 
ctataataat 
ettgttgceta 
gtactactaa 
ttgctgcgaa 
taggtgataa 
ctggtagtga 
aatatagaaa 
gtagactaca 
ttgttagtgt 
actgcccacg 
ttgctaataa 
geaggeggcet 
ctaagtataa 
gtggtacagt 
etttcctage 
tettttatat 
gtgagtttta 
tagttactaa 
gtttttcaca 
gggctagtga 
ctataactgg 
ggggtagtga 


tatttaatca 


agggttcaaa 
ctctaataag 
ctgtccaatt 
ttttgtatca 
accttggtac 
tatcactgag 
aacagecage 
ettaccattt 
tatagttcca 
ttgtagaggt 
atgggttttg 
tagagaactt 
tattcagcta 
tataaagttt 
ttgggttatt 
attattagta 
geattgttgg 
tetgggattt 
aaatactcgt 
gageactttt 
atttgaagec 
gcaagattac 
tagtggtgtt 
egetggtttt 
ctcttataga 
ccatgtgtta 
teatgagttt 
taaaggagca 
gtttgttaaa 
tataggactt 
aggagectgt 
gceaccatctt 
tggtggttat 
caatattgta 
acctaaatgg 
taatggtttt 
ggttgatgtt 


teccatttta 


agttggtggt 


7500 


7560 


7620 


7680 


7740 


7800 


7860 


7920 


7980 


8040 


8100 


8160 


8220 


8280 


8340 


8400 


8460 


8520 


8580 


8640 


8700 


8760 


8820 


8880 


8940 


9000 


9060 


9120 


9180 


9240 


9300 


9360 


9420 


9480 


9540 


9600 


9660 


9720 


9780 


60 


gttaggttac 
ttagettget 
ttttatgtac 
aaacatgtta 
ggagtttgtg 
ttaagecaat 
ttagtgttgt 
tetttgttaa 
actcttactg 
gtgttggcta 
tggattttat 
gttaatggca 
gtttttggtt 
atgtgtttge 
cttatacaag 
agtgatgtaa 
gcaaattcaa 
gatgttggag 
tetactattg 
teggttacte 
atttatgaaa 
gcatatcgta 
aagaagttag 
actgatagaa 
aaaatagatt 
ctagcaactg 
acgtgggtca 
gactgtgtta 
acttactgta 
gggcataata 
acaaaggctt 
tatacaagta 
aaagtagect 
tgtaaatttg 
aatacgaggt 
caatctaaag 
tttgcagtag 
ggtaactgtg 
ccaagtccaa 


gcacatatag 


agtettcttt 
tettgtttgt 
atgcagetgt 
tggcatacat 
etgaagtcce 
ggtatgatce 
acactgcettt 
tgectgtatca 
catatacaga 
atgttagtag 
attattgcaa 
taggetgget 
taaccttagg 
ataaggtaaa 
gaattggagg 
agtgtacaac 
aaatgcatge 
agtgcatgga 
atttgggtga 
aagagttttc 
aggttttage 
aagctgccaa 
atagcatgge 
gagcaaaatt 
etgagaaget 
ttccaattgt 
agtgtgtgga 
etgatgecga 
taagtggtga 
aggttgatgt 
gegtagcagg 
ttagtggcag 
ettttttgaa 
gtatgaaagt 
ctattgtaag 
gtcatgagac 
atectgegga 
ttaaaatgtt 
ectceggatca 


cacacectgg 


61 


tgtaagaaaa 
gttgtgtget 
tattttgttg 
ggacacttte 
tttcatatac 
tgtagtettt 
taagtgtgta 
gtttatgaag 
aggtaattgg 
taattcctta 
tgcaacatac 
ttgcacctgt 
taaatacaat 
tccacctaaa 
egategtgtg 
tgttgtttta 
ttatcttgtt 
taatttattg 
gtattgtgat 
gcacatacce 
cegattctaaa 
tattgcaaag 
agaacgtgct 
agtttcatca 
taatgtctta 
ttgtagtaat 
gggtgtgeat 
tggcacagag 
taatatagca 
tgecttgcaa 
tgtagatcaa 
ttcagttgta 
tgaggcaggt 
gggtgataag 
aggtatggta 
agaggaagtg 
tacatattgt 
gacagtacat 


ggattcttat 


eggagcagga 
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gctacatctt 
attgtcttat 
atggctgtge 
ctattgecta 
aatactctaa 
gatactatgg 
caaggctget 
ttaggttttg 
gagttattct 
attggtttaa 
tttaataatt 
tactttggat 
tttaaagttt 
actgtgtggg 
ttgcctatag 
atgcagcettt 
gagttacaca 
ggtatgctta 
gatatactta 
tegtatgcetg 
aatggtggtg 
teagtttttg 
atgacaacaa 
ttacatgcac 
tttgaccagg 
aagettacce 
gttacatatt 
ttacacccca 
tggectttaa 
aataatgage 
gcacattgta 
gctgctatta 
aatcagattt 
gttgaagttg 
ettggtgcta 
gatgctgtag 
aaatatgtgg 
aatggtagtg 
ggaggagett 


aatttagatg 


-continued 


ggttttggag 


ttacggcagt 
tctttatttc 
cattgattac 
ttagtcaagt 
taccatggat 
atatgaattc 
ttatttacac 
ttgagttggt 
ttgtttttaa 
atgtgttaat 
tgtattggtg 
cagtagatca 
aggtctttac 
ctacagtgca 
tgactaaget 
ataaaatcct 
taacactatt 
agaggtcaac 
aatatgaaag 
taacacagca 
atagagactt 
tgtataaaga 
tacttttttc 
cgaatagtgg 
ttgttatacc 
caacagttgt 
ettctacagg 
aggttaactt 
ttatgectca 
gegttgagtc 
ectcttcaaa 
atgtagactt 
tttacctgta 
tatctaatgt 
geattctcte 
cagcaggtaa 
gttttgcaat 
etgtgtgtct 


gacgctgtca 


tagatgtgta 
gcecacttaag 
ttttactgtt 
agttattatt 
tgttattttc 
gttattgcca 
tttcaatact 
ctcttcaaac 
tcacactatt 
gtgtgctaag 
ggcagtcatg 
ggttaataaa 
atataggtat 
tacaaatata 
atctaaattg 
taatgttgaa 
egeatctgat 
ttgtatagat 
tgtattacaa 
agctaagagt 
agagettget 
ggetgttcaa 
ggegegtgta 
aatgcttaag 
tgttgtacce 
agacccagag 
ttggaatata 
tagtggattg 
gactaggaat 
eggtgtaaag 
taaatgttat 
tectaatctg 
agacccacca 
ttttataaaa 
tgttgtgtta 
actttgttct 
tcaaccttta 
aacatcaaag 
ttattgtaga 


atttaaaggt 


9840 


9900 


9960 


10020 


10080 


10140 


10200 


10260 


10320 


10380 


10440 


10500 


10560 


10620 


10680 


10740 


10800 


10860 


10920 


10980 


11040 


11100 


11160 


11220 


11280 


11340 


11400 


11460 


11520 


11580 


11640 


11700 


11760 


11820 


11880 


11940 


12000 


12060 


12120 


12180 
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tettttgtge 
gtttgcactg 
ectaaacctt 
gggtacgggg 
ttgtaaageg 
agegtaactg 
gtgattctta 
gttatgaaga 
acatttataa 
etttgeggca 
gttgtataga 
tagaaaacce 
tattgaatge 
cacttgataa 
egectggtge 
ccatgactga 
aatcttatga 
actttaagta 
gtttgataca 
teggtaattt 
atcattctaa 
tgggtttgag 
ataaattagt 
ctcatcaaac 
etggtatgtt 
gtaatgetge 
tacgtcaact 
gctgtatacc 
tcaataagtt 
tetttgagag 
ccatatccge 
ctaataggca 
tagttattgg 
agggtgttga 
ctaatttgtt 
cttggtctga 
tettagctac 
ctgcttatge 


ttttgagtgt 


aaatacctac 
tttgtcagtg 
ctgttcagte 
tagcagtgag 
agectttgat 
tgcacgatte 
ttttgtggtt 
cttaaagtca 
tattagtagg 
etttgaccca 
agattatcac 
taaatattat 
tattgagtte 
ccaagatctt 
tggtgttecct 
tgegttggca 
tetecctcaag 
ttgggatcaa 
ttgtgcaaac 
gtgtagaaag 
ggaacttggt 
tcaactcatg 
ggatcttaga 
ggtaaaacca 
taaggaaggt 
tataaacgat 
tttattttgt 
agcaagcecaa 
tggaaaggeec 
tacaaagaag 
gaaaaataga 
gtttcatcag 
aacaaccaag 
agacecgatt 
gegtatagca 
acgegtttat 
aggtggtata 
aaacagtgtt 


tataacgegt 


63 


tacggagaaa 
ttggattggt 
agttgetgtt 
gcteggetga 
gtttgtaata 
caagaagtac 
aaacaaacca 
gaagtaacag 
cagaggctta 
aaggattgcg 
ectaagtggt 
gcecatgttgg 
ggaaacctca 
aatggcaaat 
gtttttgata 
ectgagaggt 
tatgattata 
gagtatcacc 
ttcaacatct 
gtttttgttg 
gttattatga 
cagtttgttg 
acgtcttgtt 
ggtcacttta 
tettctatac 
tatgattatt 
ttagaagtga 
gttgtagtta 
egtctctatt 
aacgtcctge 
gegegtacag 
aagattctta 
ttttatggeg 
ettatgggtt 
gcatctttag 
aggttgtata 
tatgtgaaac 


ttcaacataa 


gatattgtat 
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gatcctgttg 
tatggatgtc 
geatctggtt 
taccectage 
aggaatcage 
gtgatactga 
ctcctagtaa 
ctgatcatga 
ctaagtatac 
aagttcttaa 
ttgaagagaa 
ctaaaatggg 
tggttgaaaa 
tttatgattt 
egtattattc 
attttgaata 
etgaggagaa 
ctaactgtcg 
tgttttctac 
atggtgtacec 
atcaagataa 
gagatcctge 
ttagtgtttg 
acaaggattt 
cacttaaaca 
atcgttataa 
ettctaaata 
acaatttaga 
atgaaatgag 
ctactataac 
tggcaggtgt 
agtctatagt 
gttgggataa 
gggattatce 
tactegctcg 
atgaatgege 
ctggtggtac 
tacaagcecac 


atgatgacat 


-continued 


gattetgtct 
agtgtgatte 
ttgataagaa 
taatggatgt 
eggtatgttt 
agatggaaat 
ttatgaacat 
tttctttgtg 
tatgatggat 
agaaatactt 
taaggattgg 
acctattgta 
aggttatgtt 
tggtgatttt 
ttacatgatg 
tgatgtgcat 
acaagatttg 
egactgtagt 
acttgtaccg 
atttataget 
caccatgtca 
ettgttagtg 
tgctttageg 
ctacgatttt 
tttcttctac 
caggectace 
ttttgaatgt 
taagagtgca 
tctagaggag 
tcagatgaat 
gtctatectt 
caacactaga 
catgttgaga 
aaagtgtgat 
taaacacact 
tcaggtttta 
tagcagtgga 
atctgctaat 


taagagettg 


acgtaacaag 
acttagacaa 
ttatttaaac 
gacccegatg 
caaaatttga 
cttgagtatt 
gagaaagett 
ttcaataaga 
ttttgctatg 
gtcacttatg 
tacgacccaa 
egacgtgett 
ggtgttatta 
cagaagacag 
cccatcatag 
aagggttata 
tttcagaagt 
gatgacaggt 
cagacttctt 
acttgtggct 
ttttcaaaaa 
gggacatcca 
tetggtatta 
geagagaagg 
ccacagactg 
atgtttgata 
tatgaaggeg 
ggttatcegt 
caggaccaac 
ttaaaatatg 
tctactatga 
aacgctcctg 
aaccttattc 
agagcaatge 
aattgttgta 
tctgaaactg 
gatgctacta 
gttgegegtc 


cagtatgaat 


12240 


12300 


12360 


12420 


12480 


12540 


12600 


12660 


12720 


12780 


12840 


12900 


12960 


13020 


13080 


13140 


13200 


13260 


13320 


13380 


13440 


13500 


13560 


13620 


13680 


13740 


13800 


13860 


13920 


13980 


14040 


14100 


14160 


14220 


14280 


14340 


14400 


14460 


14520 
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tgtaccagca 
ettatttgtg 
acaacacatt 
actatcagaa 
aaggcccaca 
gatacttgec 
tggataagac 
acccactagt 
acatcagaaa 
tggatataga 
ecectacaac 
getgtggtaa 
tgcacacaga 
gtggtgaage 
ataaaccaaa 
ctaattgtge 
ctactgtgga 
cagagacagt 
gagaagtact 
cattgaatag 
teggtgattt 
ctactgctaa 
ttatagegcee 
atgtgatggt 
agaagegtac 
tggeggctta 
atgetttatg 
etcaaaggac 
acatttttag 
aggttagtat 
atgttgtgta 
cactctctce 
ttttecttge 
tatatgatgg 
ttaataatgg 
tagaatttgt 
caccttataa 
tagactegte 
agcatgceact 


tagttgtcat 


ggtttatagg 
taagaattte 
agccaaacaa 
caatgttttt 
tgaattttgt 
atatccagac 
agaatctgtg 
acatcatgaa 
actctatcaa 
taagggtagt 
attacagtct 
ttgtattecge 
ccacaaaaat 
agatgttact 
gttatcaata 
aggtagcgaa 
accttatatt 
aaaagctaca 
ctcagatcgt 
aaattatgtt 
tacatttgaa 
attgtctgtt 
aacgttgtgt 
acctgegtgt 
tacagtacaa 
etttagtaac 
tgaaaaaget 
tactatcgat 
tactattaat 
gttgaccaat 
tgtaggtgat 
aaaggattat 
aaagtgttac 
aaagtttatt 
taattctgat 
gaaagatttt 
tgctatgaac 
tcaaggttcg 


gaatattaac 


gegtcagegt 


65 


egagtcaatt 
tcattgatga 
ggtcttgtag 
atggctgatt 
tcacagcaca 
ccatcacgta 
getgttatgg 
aatgaggagt 
gagetttcte 
aaattttggg 
tgtggegttt 
aaaccatttt 
gttttgtcta 
aaattgtacc 
cegttagtat 
aatgttgatg 
ttggcaaatc 
gaagaattac 
gaattgatte 
ttceactggct 
aaaggtgaag 
ggagacattt 
ectcagcaaa 
tttgtaaata 
ggeectcctg 
gecegtgteg 
tttaagttte 
tgettctcta 
gecttgecag 
tacgaattgt 
ectgctcaat 
aatgttgtca 
egttgtccta 
gcaaataacc 
gtaggacatg 
gtctgtcgca 
cagagagect 
gagtatgatt 
agattcaatg 


gatgaactat 
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ttgacccage 
tettgtctga 
cagatatttc 
ctaaatgttg 
caatgttagt 
ttttgtgtge 
agegttatat 
acaagaaggt 
agaatatgcet 
aacaggagtt 
gtgtagtgtg 
tgtgttgtaa 
taaatcctta 
teggaggtat 
ctaatggtac 
attttaatca 
gttgtgtaga 
ataagcaaca 
tgtcttggga 
ttcactttac 
gtaaggacgt 
ttgttttaac 
ecettttctag 
acattccatt 
gcagtggtaa 
tttttactge 
ttaaagtaga 
agtttaaage 
aagttagttg 
ettttattaa 
taceggegec 
caaaccttat 
aagaaattgt 
eggaatcacg 
aaagtggctc 
ataaggaatg 
accgtatgcet 
atgttatctt 
tagegcttac 


attcagetct 


-continued 


atttgttgaa 
egacggtgtt 
tggttttaga 
ggttgaacca 
ggaggttgat 
atgtgttttt 
egetcttgec 
attctttgtg 
tatggactac 
ctatgaaaat 
taatagtcaa 
gtgttgctat 
catttgctca 
gtcatacttc 
agtgtttgga 
actagctact 
ttegttgaga 
atttgctagt 
gecaggtaaa 
tagaactagt 
tgtctattat 
ctcacacaat 
gtttgtgaat 
gtaccattta 
atcccatttt 
atgctctcat 
tgattgcact 
taatgacaca 
tgacattctt 
tggtaagata 
tegtacgttg 
ggtttgtgtt 
agatactgtt 
teagtgtttec 
agectacaac 
gegggaagea 
tggacttaat 
ttgtgttact 
aagagccaag 


taagtttata 


aagttttatt 
gtttgttata 
gaagttctct 
gatttagaaa 
ggtgagecta 
gtagatgatt 
atagatgegt 
ettctttcat 
tettttgtaa 
atgtatagag 
actatattge 
gaccatgtca 
cagecaggtt 
tgceggtaatc 
atttacaggg 
actaattggt 
egetttgctg 
gcagaagtga 
accaggectc 
aaagttcage 
egagegacgt 
gttgtttcte 
ttaagaccta 
gtaggcaage 
gctataggat 
geagetgttg 
egtatagtac 
ggcaaaaagt 
ttggttgacg 
aactatcaat 
ettaacggtt 
aaacctgaca 
tetactcttg 
aaggttatag 
ataactcaat 
acattcattt 
gttcagacag 
gcagattcge 
egtggtatac 


gagcttgata 


14580 


14640 


14700 


14760 


14820 


14880 


14940 


15000 


15060 


15120 


15180 


15240 


15300 


15360 


15420 


15480 


15540 


15600 


15660 


15720 


15780 


15840 


15900 


15960 


16020 


16080 


16140 


16200 


16260 


16320 


16380 


16440 


16500 


16560 


16620 


16680 


16740 


16800 


16860 


16920 
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gtgtagcaag 
ttcaccecage 
aacttgctge 
etttgttagg 
gtgatgagge 
ettgeggtac 
actttgtagt 
tgaattctaa 
ctaaaccttg 
acgtttcaga 
getattttgt 
ttaattctca 
ataatccact 
atgatttgca 
egegttgtct 
ctcatatage 
ttaatgcatg 
gtataaaatg 
tacccaatgt 
gtetttgtat 
ggtacgacac 
atgttaacaa 
aagctatgce 
ttgcgcaaga 
gtgctgtttg 
etgttactge 
aaagttttte 
attatgatge 
tagatcaagg 
egtttgagcet 
gtttgggtgt 
tataccgtaa 
tgctgtatga 
tagtttctac 
ttcagaacgg 
gtgegtttgt 
aacctcgtag 
agtatggtaa 


taggtggttt 


tetgcaaggt 
ttatgcagtc 
acttgttaac 
gtttaagatg 
tatccgcaac 
taacattggt 
tacgectgag 
agcacctcca 
geatgttgta 
ttgtgtagtg 
taaaataggc 
tactcaggct 
ettagtggat 
ttgtaatgtg 
tgcaattaat 
aaatgaggat 
tgttgatget 
tgttagacgt 
caageagttt 
gttttggaat 
acgaaatttg 
geatgeattec 
attcttttte 
ecttgtgtca 
taaaaagcac 
tggttttact 
agetctccag 
tattgcagga 
cgtagaaaaa 
gtatgcgaag 
agatgtgact 
tactgttaag 
tgatagatat 
acagtgttac 
tattcegtta 
tacgctacct 
tgatgttgag 
agaattaggt 


acacactgtt 
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acaggcttgt 
acaactaagg 
gtggaagcetg 
agtgttaatg 
gtaagaggtt 
actaacctge 
ggacttgtag 
ggtgaacaat 
aggccaagga 
tttgtcacgt 
aaggaccaag 
tatgettgtt 
attcaacagt 
catggacacg 
aatgceatttt 
gaagtcaatt 
cttaaagtta 
ggagacttaa 
gagtatgact 
tgtaatgtgg 
agtgtgttta 
cacacaccta 
tatgactcat 
ttagctacga 
gcacaaatgt 
ttttgggtta 
tetatcgaca 
gaaatgcecca 
geagtttttt 
agaaatattc 
aatggatttg 
gtatgtgcat 
ggtgattacc 
aageggtatt 
aaagatggag 
aacacattaa 
egtgatttte 
ctacagcaca 


ataggtatgt 
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ttaaaatttg 
etettgctge 
gttcagaaat 
ttgaaggcetg 
gggtaggttt 
ctttccaagt 
atacttcaat 
ttaatcactt 
ttgtgcaaat 
ggtgtcatgg 
tttgttcttg 
ggaagceattg 
ggggttattc 
cacatgtage 
gtcaagatgt 
ctagctgtag 
acgttgtcta 
attttagatt 
ataatcagca 
attgttatcc 
acctacctgg 
aatttgatcg 
egecttgcega 
aagattgtat 
atgcagattt 
ctaataattt 
atattgctta 
ctatcgtaac 
ttaatcaaac 
gceacactgcc 
taatttggga 
atacagacat 
agtettttct 
egtatgtaga 
egaacctgta 
acacacaggg 
tegacatgtec 
tactgtatgg 


geagactttt 


-continued 


caacaaagag 
aacttataaa 
aacatataaa 
ccacaacatg 
tgatgtagaa 
aggtttctct 
aggcaataat 
gagagegtta 
gttageggat 
ectagaacta 
eggttctaga 
ettgggtttt 
tggtaaccta 
ttetgceggat 
caactgggat 
atatttacaa 
tgatatagge 
ctatgataag 
caaagataag 
egacaattce 
ttgtaatggt 
cactagettt 
gaccattcaa 
cacaaaatge 
tgtgacttct 
taacccatat 
taatatgtat 
tggagataaa 
aattctgect 
aaacaaccgt 
ttacacgaac 
agaaccaaat 
agetgctgat 
aataccgtca 
tgtttataag 
tegeagttat 
tgaggagagt 
tgaagttgat 


acgtgcgaat 


tttagtggtg 
gttaatgatg 
catcttattt 
tttataacac 
gcaacacatg 
actggtgcag 
tttgagectg 
ttcaaaagtg 
aacctgtgca 
accactttge 
gcaacaactt 
gattttgttt 
caatttaacc 
gctattatga 
ttaacttacc 
egeatgtatc 
aaccctaaag 
aatccaatag 
tttgctgatg 
ttagtttgta 
ggtagettgt 
cgtaatttga 
ttggatggag 
aacataggcg 
tataatgcag 
aatttgtgga 
aagggtggtc 
gtttttgtta 
acatctgtag 
attttgaaag 
caaacaccac 
ggectaatag 
aatgetgttt 
aacctgcttg 
egtgttaatg 
gaaacttttg 
tttgtagaaa 
aagecccaat 


aagttgaacg 


16980 


17040 


17100 


17160 


17220 


17280 


17340 


17400 


17460 


17520 


17580 


17640 


17700 


17760 


17820 


17880 


17940 


18000 


18060 


18120 


18180 


18240 


18300 


18360 


18420 


18480 


18540 


18600 


18660 


18720 


18780 


18840 


18900 


18960 


19020 


19080 


19140 


19200 


19260 
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caaagtctgt 
atggttccta 
ttcttaggaa 
ttgattacca 
atccacagct 
agaattgtgt 
gtggtattat 
caatgtgtgt 
tggetccagg 
atgatattgt 
ataagacaga 
aaagaaagca 
caagttttct 
caagttggca 
gtacagcagt 
caagtgaaaa 
attgtaatta 
gattgaaage 
atttaattaa 
actettttgt 
taccaaagtg 
gttaatattt 
attcatggtg 
atggecttggt 
tttgttacac 
tttttacgtg 
gctaagtacec 
aatggtgate 
tttaaagetg 
tttgttaatg 
geatgecagt 
ttagttaage 
cacaatttca 
attcaaactt 
etgagtagtt 
aattttagac 
gcettacggtc 
tgttatgett 
gatcttaatt 


caaacagcca 


tactaattct 
caagcaagtg 
catactgaaa 
tagcataaat 
tcaatcagca 
tatggaacct 
gatgaatgtg 
accgcataat 
tagtactgtt 
agactatgtg 
gcacaagttt 
tgaaggegtg 
tegtaataat 
cgaagtttta 
gaatgectct 
ggttaaggtt 
tttacaaacc 
aacaccagtt 
gtgtggtaag 
gtgtactatg 
ectttagace 
ctagcegaate 
gtegtgttgt 
ctagcagtca 
attgttataa 
tttctgctat 
ctacttttaa 
ttgtttacac 
gtggacctat 
gtactgcaca 
ataatactgg 
agaagtttat 
ettttcataa 
accaaacaca 
ttgtttataa 
tagaaactat 
etcttcaagg 
attcatatgg 
ttgaatgtgg 


ctgaaccegec 
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gattctgatg 
tgtactgttg 
gagtatggta 
tttatgactt 
tggacgtgtg 
tgcaacattc 
gcaaagtata 
atgcgagtaa 
cttaaacaat 
tetgatgcac 
gatcttgtga 
atagccaata 
ttggctctag 
tatgacattg 
tettcagaag 
agtggaaaaa 
tetgettata 
gttaatttga 
ttactggtaa 
tagtgetget 
acctaatggt 
taataatgca 
taatgettct 
gttttgtact 
atatgatggg 
gaaaaatgge 
atcatttcag 
ctctaatgag 
aacttataaa 
agatgttatt 
caatttttca 
tgtctategt 
tgagactgge 
aacagctcag 
ggagtctaat 
taataatgge 
tggttgcaag 
aggtccttcg 
actgttagtt 


agttataact 
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tcatgcaaaa 
tggatttget 
ctaataagtc 
ggtttgaaga 
gttataatat 
ctaattatgg 
cacaactctg 
tgcattttgg 
ggctcccaga 
atgtttctgt 
tatctgatat 
atggcaatga 
gtggtagttt 
cacaggattg 
cattecttggt 
egetgcacge 
gtatatttga 
aaactgaaca 
gagatgttgg 
ttgtatgaca 
tggcatttac 
ggetcttcac 
tctatagcta 
gcacactgta 
tgtcctataa 
cagettttct 
tgtgttaata 
accacagatg 
gttatgagag 
ttgtgtgatg 
gatggctttt 
gaaaatagtg 
gecaacccta 
agtggttatt 
tttatgtatg 
ttgtggttta 
caatctgtct 
ctgtgtaaag 
tatgttacta 


cgacacaatt 


-continued 


ttattttgta 
gettgatgat 
taaagttgta 
tggcattatt 
gectgaactt 
tgttggaata 
teaatacctt 
agctggaagt 
agggacactc 
getttcagat 
gtatacagac 
tgacgtttte 
tgctgtaaaa 
tgcatggtgg 
tggtgttaat 
aaattatata 
egttgctaag 
aaagacagac 
taacacctct 
gtagttctta 
acgggggtge 
etgggtgtat 
tgacggcace 
acttttcaga 
etggeatget 
ataatttaac 
atttaacatc 
ttacatctge 
aagttaaage 
gatcacctag 
atccttttat 
ttaatactac 
atcctagtgg 
ataattttaa 
gatcttatca 
attcactttc 
ttagtggtag 
gtgtttattc 
agagcggtgg 


ataataatat 


ttggcagaca 
ttcttagaac 
acagtgtcaa 
aaaacatgtt 
tataaagttc 
gegttgccaa 
tcegaaaacaa 
gacaaaggag 
ettgtcgata 
tgcaataaat 
aatgattcaa 
atatatctct 
gtgacagaga 
acaatgtttt 
tatttgggtg 
ttttggagga 
tttgatttga 
ttagtcttta 
tttactagtg 
egtttactac 
ttatgeggta 
tgttggtact 
gtcatcaggt 
tactacagtg 
tcaaaagaat 
agttagtgta 
egtatattta 
aggtgtttat 
ectggcttat 
aggcttgtta 
taatagtagt 
ttttacgtta 
tgttcagaat 
tttttccttt 
eccaagttgt 
agtttcaatt 
agcaacttgt 
aggtgagtta 
ctctcgtata 


tactttaaat 


19320 


19380 


19440 


19500 


19560 


19620 


19680 


19740 


19800 


19860 


19920 


19980 


20040 


20100 


20160 


20220 


20280 


20340 


20400 


20460 


20520 


20580 


20640 


20700 


20760 


20820 


20880 


20940 


21000 


21060 


21120 


21180 


21240 


21300 


21360 


21420 


21480 


21540 


21600 


21660 
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acttgtgttg 
gactcagetg 
ggttccatag 
ecttgcgaag 
acttcacgta 
aatggaacac 
agttatggta 
ttggaacagt 
tttaatttaa 
tgtctgcagt 
ecetgtttgtg 
ettttgaatt 
gttagcactg 
egttctttta 
gacgcataca 
cgtgaatata 
tatactagtt 
ecttttgceca 
ttgaagaatec 
ggttttagaa 
getattctta 
attcaagaaa 
ataactggta 
agagtgtcac 
tctattaggt 
ectaatggta 
gcaatagtgg 
aatggtaggg 
tatatgccaa 
tatgtaagtg 
aatgacgaat 
ttcaattaca 
atacagggtc 
attaagtgge 
atactaggat 
attatgecte 
gatgtggtaa 
ecttcctaat 


ttattatage 


attataatat 
ttagttataa 
acatetttgt 
atgtcaacca 
atgagactgg 
gtegttttag 
agttttgtat 
ttgtggcace 
ctgttacaga 
atgtttgtgg 
acaacatatt 
tctattcttc 
gtgagtttaa 
ttgaagacct 
aaaattgcac 
atggtttget 
ctctagtage 
cacaactgca 
aagaaaaaat 
gtacatctct 
etgagactat 
tetaccagca 
gattgtcate 
aacagegtga 
actccttttg 
tagtgtttat 
gtttttgtgt 
gtatttttat 
gagctattac 
taaataagac 
tgtcaaaatg 
cagtacctat 
ttaatgactc 
ettggtatgt 
gggttttctt 
taatgagtaa 
cttaacaata 
agtattaatt 


gctccaacaa 
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atatggcaga 
ttatctagca 
tgtacaaggt 
gcagtttgta 
ttctcagett 
acgttctatt 
aaaacctgat 
tttacttaat 
tgagtacata 
caattctctg 
gtctgtagta 
tactaaaccg 
tatttctctt 
tctatttaca 
tgcaggacct 
tgtgttgect 
ttetatgget 
ggctagaatt 
tgetgettce 
agcattacaa 
ggcatcactt 
acttgacgcc 
actttctgtt 
gttagetact 
tggtaatgga 
acacttttct 
aaagecagct 
acaagttaat 
tgcaggagat 
egtcattact 
gtggaatgac 
acttgacatt 
tttaatagac 
gtggttagec 
catgactgga 
gtgtggtaag 
cagacctaaa 
tttetttggt 


ctaatacaag 
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actggccaag 
gacgcaggtt 
gaatatggtc 
gtttectggtg 
cttgagaace 
actgaaaatg 
ggttcaattg 
gttactgaaa 
caaacgegta 
gattgtagag 
aatagtattg 
getggtttta 
ctgttaacaa 
agegttgaat 
ttaggttttec 
cccattataa 
tttggtggta 
aatcacttgg 
tttaataagg 
caaattcaag 
aataaaaatt 
atacaagcaa 
ttagceatctg 
cagaaaatta 
egacatgttec 
tatactccag 
aatgctagtc 
ggtagttact 
atagttacge 
acattcgtag 
actaagcatg 
gatagtgaaa 
cttgaaaaac 
atagettttg 
tgttgtggtt 
aaatcttctt 
aagtctgttt 
gtaaacttgt 


ttttactcca 


-continued 


gttttattac 
tggetatttt 
ttacttatta 
gtaaattagt 
agttttacat 
ttgcaaattg 
ccacaatagt 
atgtgctcat 
tggataaggt 
atttgtttca 
gtcaaaaaga 
atacaccatt 
ctcctagtag 
etgttggatt 
ttaaggacct 
cagcagaaat 
ttactgcage 
gtattaccca 
cceattggtcg 
atgttgttaa 
ttggtgetat 
atgctcaagt 
ctaagcagge 
atgagtgtgt 
taaccatacc 
atagttttgt 
agtatgcaat 
acatcacage 
ttacttcttg 
acaatgatga 
agctaccaga 
ttgategtat 
tttcaatact 
ccactattat 
gttgttgtgg 
attacacgac 
aatgattcaa 
actaagttgt 


aattatcaat 


taatgtaacc 
agatacatct 
taaggttaac 
aggtattctt 
taaaatcact 
cecttatgtt 
accaaaacaa 
acctaacagt 
ccaaattaat 
acaatatggg 
agatatggaa 
tettagtaat 
tectagaagg 
accaacagat 
tgegtgtget 
gcaaattttg 
tggtgctata 
gtcacttttg 
tatgcaggaa 
taagcagagt 
ttcttctatg 
ggategtctt 
ggageatatt 
taagtcacag 
gcaaaatgea 
taatgttact 
agtaccegcet 
acgagatatg 
tcaagcaaat 
ttttgatttt 
ctttgacaaa 
teaaggegtt 
caaaacttat 
cttcatctta 
atgctttgge 
ttttgataac 
agtcccacgt 
tttagagagt 


agtaacttac 


21720 


21780 


21840 


21900 


21960 


22020 


22080 


22140 


22200 


22260 


22320 


22380 


22440 


22500 


22560 


22620 


22680 


22740 


22800 


22860 


22920 


22980 


23040 


23100 


23160 


23220 


23280 


23340 


23400 


23460 


23520 


23580 


23640 


23700 


23760 


23820 


23880 


23940 


24000 


q2 


agectagact 
aactggtgag 
aaacacagaa 
aatagagtca 
aaaatggaag 
taggtagage 
catgggtagt 
gaaaacttaa 
ggaataataa 
tttgaacagt 
ttcttaacca 
aaaatgatag 
atatacccac 
etgtettttg 
tggtcattta 
tgtaattttg 
etttattgtg 
tttgtttgta 
caaagcggaa 
ggegagcetag 
agagagtatt 
atttttgagg 
tttaaaaaac 
tttcaagtag 
tgtaggttgt 
aggegtttta 
taagggcagt 
ggttaatctt 
ttcaattaga 
tagtaaagat 
aggattagat 
tacctctcta 
gtcatggcaa 


ggaggaccaa 


aaagccaaga 


gaaaatctaa 


ggtaaaggtg 


ccageegcta 


ggtgctgata 


tatccegctac 


gacectttgt 
caagtgattec 
gtatttgace 
gctgaagatt 
ttttctaaca 
acttcaagca 
aattccagga 
caatccggaa 
aaatccagca 
cagttgaget 
taatacttca 
tgttatggtg 
caaacacagg 
taggttattg 
acccagaatc 
ctatagagag 
agggtcagtg 
caceggatag 
ataagaaacg 
aaagtgtage 
taaaattatt 
atattaatat 
agtttttcca 
ataatggaaa 
ggttgagtta 
tettacaage 
tatttcatgt 
agatcatgga 
tttagtttat 
aatccttttt 
tgtgtttact 
gtattccagg 
geggtaagge 
agecacctaa 
agttaaattc 
aaccaagtca 
gaagaaaace 
acctgaattg 
ctaaatttag 


ggttttcaga 
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cacagtctag 
aaaaaatcag 
ectttgacta 
gttcaggtga 
gegetttata 
tttgtacagg 
gctaagggta 
ttagaagcag 
aattttcaag 
ttttaaagag 
gtatggctat 
ettttggece 
aggtcttgtc 
gatccagagt 
taatgecgta 
tgtgecaatg 
gettgctaag 
acgtaatatc 
gtttgctacg 
aacaggaggg 
etttaatagt 
aaatcctctc 
etettttgtg 
agtctactac 
taaaaaagat 
gcttaataaa 
tataaacccc 
ccaaaacaca 
aggttggcgt 
geggagcaat 
ttcttaacaa 
ggaaaacttg 
aactggaaag 
agttggttct 
acctcegect 
gcagcatgga 
agtcccagat 
gggtgatage 


atctaatcag 


eggaggacct 
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actaatgtta 
tttcaattta 
ttgttattac 
tgatgaattt 
tatttgtagg 
etgctgatge 
cagectttgt 
ttattgtcaa 
atgtccaacg 
tataatttat 
gcaacaagaa 
cttaacattg 
gcagegataa 
attagactct 
ggttcaatac 
gtgetttcte 
tgtgaaccag 
taccgtatgg 
tttgtctatg 
agtagtcttt 
gectctattt 
tgttttatac 
ccaaaaacta 
gaaggaaaac 
taaactacct 
tacggacgat 
tattattaac 
tettaacgtg 
atacgcectac 
agcaagaaaa 
agcaggacaa 
tgaggaacac 
acagatgcce 
tctggaaatg 
aagtttgaag 
tattggagac 
gettggtatt 


caagatggta 


ggtactcgtg 


gatggtaatt 


-continued 


aacttagaag 
cagcatattt 
agaggaggta 
attgaataag 
atttttagca 
ttgttgttta 
atataagtat 
egagtttcct 
agacaaattg 
ttataactge 
gtaagtttat 
cagtaggtgt 
tacttacagt 
ttaageggtg 
tectaactaa 
caattataaa 
accacttgce 
tgcagaaata 
caaagcagtc 
acacctaaat 
taagagcegca 
tetcttttca 
ttgttgttaa 
caatttttca 
actacactta 
gaaatggctg 
tcaattaaga 
tgttaggtge 
tcaategctg 
gegegaattt 
gcagagtctt 
aaatataata 
cagctccagt 
tatettggtt 
gtageggtgt 
gccaagctag 
tttactatac 
tagtgtgggt 
actctgacaa 


teegttggga 


caattattga 
caagtgtatt 
atttttggga 
tegctagagg 
ctttatcttc 
ttttggtata 
acatatggta 
aagaacggtt 
tactcttgac 
attcttgttg 
ttatatactg 
aatttcatgt 
gtttgegtgt 
taggtcatgg 
tggtcaacaa 
gaatggtgtt 
taaagatata 
tactggtgac 
agtagatact 
gtgtgtgtgt 
taatagtatt 
agagctatta 
tggtgtaacc 
gaaaggttgt 
tttttataag 
actagttttg 
gtattagata 
gtgattttgt 
gtatgaataa 
atctgagaga 
gtecegegtg 
ataatctttt 
catcaaacta 
tcaagcaata 
tectgataat 


gtttaageca 


tggaacagga 


tgctggtaag 


gtttgaccaa 


tttcattcct 


24060 


24120 


24180 


24240 


24300 


24360 


24420 


24480 


24540 


24600 


24660 


24720 


24780 


24840 


24900 


24960 


25020 


25080 


25140 


25200 


25260 


25320 


25380 


25440 


25500 


25560 


25620 


25680 


25740 


25800 


25860 


25920 


25980 


26040 


26100 


26160 


26220 


26280 


26340 


26400 
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ctgaatcgtg gcaggagtgg gagatcaaca gcagcttcat 
ecatcacgtg aagtttcgeg tggtcgcagg agtggttctg 
gcagcaagga taattcagga tcagcagaag aagggttctc 
gatgaaatgg ctcaccgecg gtattgcaag cgcactattec 
caagtgtttg gtccccgtac taaaggtaag gagggaaatt 
gaggaaggta ttaaggatgg gcgegttaca gcaatgctca 
gettgtcttt teggaagtag agtgacgece agacttcaac 
tttgaattta ctactgtggt cccacgtgat gatccgcagt 
tgtgatcagt gtgttgatgg tgtaggaaca cgtccaaaag 
teacgctcaa gttcaagacc tgcaacaaga ggaaattcte 
ectaagaagg agaaaaagcc aaagaagcag gatgatgaag 
gatgaggaga ggaacaatge acagctggaa tttgatgatg 
ggggattcag ccctaggaga gaatgaactt tgagtaaaat 
agataggcat gtagcttgat tacctacatg tctatcgcca 
acttagtage ctggaaacga acggtagacc cttagatttt 
ttagtttaag ttagtttaga gtaggtataa agatgccagt 
gacecgagggt acagcactag gacgcccatt aggggaagag 
agtttaattg gctatgtata gttaaaattt ataggctagt 
aaaaaaaaaa aaaaaaaaaa 

<210> SEQ ID NO 2 

<211> LENGTH: 435 

<212> TYPE: DNA 

<213> ORGANISM: Infectious bronchitis virus 
<400> SEQUENCE: 2 

tctaaaggtc atgagacaga ggaagtggat gctgtaggca 
gcagtagatce ctgcggatac atattgtaaa tatgtggcag 
aactgtgtta aaatgttgac agtacataat ggtagtggtt 
agtccaactc cggatcagga ttcttatgga ggagcttctg 
catatagcac accttggcgg agcaggaaat ttagatggac 
tttgtgcaaa tacctactac ggagaaagat cctgttggat 
tgcactgttt gtcagtgttg gattggttat ggatgtcagt 
aaaccttctg ttcag 

<210> SEQ ID NO 3 

<211> LENGTH: 1563 


<212> TYPE: DNA 
<213> ORGANISM: Infectious bronchitis virus 


<400> SEQUENCE: 3 

ggtacaggct tgtttaaaat ttgcaacaaa gagtttagtg 
gtcacaacta aggctcttge tgcaacttat aaagttaatg 
aacgtggaag ctggttcaga aataacatat aaacatctta 


atgagtgtta atgttgaagg ctgccacaac atgtttataa 


-continued 


cagcageatc 
aagatgatct 
gcattacaaa 
cacctaatta 
ttggtgatga 
acctagttce 
cagatgggcet 
ttgataatta 
atgatgaacc 
cagegcecaag 
tggataaage 
aacccaaggt 
tcaatagtaa 
gggaaatgtc 
aatttagttt 
geceggggeca 
ctaaatttta 


atagagttag 


ttcetctcact 
caggtaatca 
ttgcaataac 
tgtgtcttta 
gctgtcaatt 
tetgtctacg 


gtgattcact 


gtgttcacce 


atgaacttge 


tttctttgtt 


cacgtgatga 


tagtagagca 
tattgetegt 
ggctaaggcet 
taaggttgat 
caagatgaat 
tagcagecat 
gcacttgaaa 
tgtaaaaatt 
aagaccaaag 
acagcagege 
attgacctca 
aattaactgg 
gagttaagga 
taatttgtct 
aatttttagt 
egeggagtac 
gtttaagtta 


agcaaaaaaa 


ttgttetttt 
acctttaggt 
atcaaageca 
ttgtagagca 
taaaggttct 
taacaaggtt 


tagacaacct 


agettatgca 


tgeacttgtt 


agggtttaag 


ggctatccege 


26460 


26520 


26580 


26640 


26700 


26760 


26820 


26880 


26940 


27000 


27060 


27120 


27180 


27240 


27300 


27360 


27420 


27480 


27500 


60 


120 


180 


240 


300 


360 


420 


435 


60 


120 


180 


240 


76 


aacgtaagag 
ggtactaace 
gagggacttg 
ecaggtgaac 
gtaaggccaa 
gtgtttgtca 
ggcaaggace 
gettatgett 
gatattcaac 
gtgcatggac 
aataatgcat 
gatgaagtca 
gctcttaaag 
egtggagact 
tttgagtatg 
aattgtaatg 
ttgagtgtgt 
ttccacacac 
ttctatgact 
tcattagcta 
cacgcacaaa 
actttttggg 


cag 


gttgggtagg 


tgectttcca 
tagatacttc 
aatttaatca 
ggattgtgca 
egtggtgtca 
aagtttgtte 
gttggaagca 
agtggggtta 
acgcacatgt 
tttgtcaaga 
attctagetg 
ttaacgttgt 
taaattttag 
actataatca 
tggattgtta 
ttaacctace 
ctaaatttga 
catcgecttg 
cgaaagattg 
tgtatgcaga 


ttactaataa 


<210> SEQ ID NO 4 
<211> LENGTH: 1014 


<212> TYPE: 
<213> ORGANISM: 


DNA 


<400> SEQUENCE: 4 


tetatcgaca 


gaaatgcecca 


geagtttttt 


agaaatattc 


aatggatttg 


gtatgtgceat 


ggtgattacec 


aageggtatt 


aaagatggag 


aacacaataa 


egtgatttte 


ctacagcaca 


ataggtatgt 


atattgcetta 


ctatcgtaac 


ttaatcaaac 


geacactgec 


taatttggga 


atacagacat 


agtettttct 


egtatgtaga 


cgaacctgta 


acacacaggg 


tegacatgtc 


tactgtatgg 


geagactttt 


77 


ttttgatgta 
agtaggtttc 
aataggcaat 
ettgagageg 
aatgttagcg 
tggectagaa 
ttgeggttct 
ttgettgggt 
ttctggtaac 
agettctgcg 
tgtcaactgg 
tagatattta 
ctatgatata 
attctatgat 
gcacaaagat 
teccgacaat 
tggttgtaat 
tegcactage 
egagaccatt 
tatcacaaaa 
ttttgtgact 


ttttaaccca 


taatatgtat 
tggagataaa 
aattctgect 
aaacaaccgt 
ttacacgaac 


agaaccaaat 
agetgctgat 
aataccgtca 
tgtttataag 
tegaagttat 
tgaggagagt 
tgaagttgat 


acgtgcgaat 
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gaagceaacac 
tetactggtg 
aattttgage 
ttattcaaaa 
gataacctgt 
ctaaccactt 
agagcaacaa 
tttgattttg 
ctacaattta 
gatgctatta 
gatttaactt 
caacgeatgt 
ggcaacccta 
aagaatccaa 
aagtttgctg 
tecttacttt 
ggtggtaget 
tttcgtaatt 
caattggatg 
tgcaacatag 
tettataatg 


tataatttgt 


Infectious bronchitis virus 


aagggtggtc 


gtttttgtta 


acatctgtag 


attttgaaag 


caaacaccac 


ggcctaatag 


aatgetgttt 


aacctgcttg 


egtgttaatg 


gaaacttttg 


tttgtagaaa 


aagecccaat 


aagttgaacg 


-continued 


atgcttgcegg 
cagactttgt 
ctgtgaattc 
gtgctaaace 
gcaacgtttc 
tgegetattt 
cttttaattc 
tttataatcc 
accatgattt 
tgacgegttg 
accctcatat 
atcttaatge 
aaggtattaa 
tagtacccaa 
atggtctttg 
gtaggtacga 
tgtatgttaa 
tgaaagctat 
gagttgcegca 
geggtgetgt 
cagctgttac 


ggaaaagttt 


attatgatge 
tagatcaagg 
egtttgaget 
gtttgggtgt 
tataccgtaa 
tgctgtatga 
tagtttctac 
ttcagaacgg 
gtgegtttgt 
aacctcgtag 
agtatggtaa 
taggtggttt 


caaagtctgt 


tactaacatt 
agttacgect 
taaageacct 
ttggeatgtt 
agattgtgta 
tgttaaaata 
tcatactcag 
actcttagtg 
gceattgtaat 
tettgcaatt 
agcaaatgag 
atgtgttgat 
atgtgttaga 
tgtcaagcag 
tatgttttgg 
cacacgaaat 
caagcatgca 
gecattettt 
agaccttgtg 
ttgtaaaaag 
tgetggtttt 


tteagetcte 


tattgcagga 
cgtagaaaaa 
gtatgcegaag 
agatgtgact 
tactgttaag 
tgatagatat 
acagtgttac 
tattcegtta 
tacgctacct 
tgatgttgag 
agaattaggt 
ccacactgtt 


tactaattct 


300 


360 


420 


480 


540 


600 


660 


720 


780 


840 


900 


960 


1020 


1080 


1140 


1200 


1260 


1320 


1380 


1440 


1500 


1560 


1563 


60 


120 


180 


240 


300 


360 


420 


480 


540 


600 


660 


720 


780 


78 


gattctgatg 


tgtactgttg 


gagtatggta 


tttatgactt 


tcatgcaaaa 


tggatttget 


ctaataagtc 


ggtttgaaga 


<210> SEQ ID NO 5 
<211> LENGTH: 909 


<212> TYPE: 
<213> ORGANISM: 


DNA 


<400> SEQUENCE: 5 


tcagcatgga 
gaaccttgca 
aatgtggcaa 
cataatatge 
actgttctta 
tatgtgtctg 
aagtttgatc 
ggcegtgatag 
aataatttgg 
gttttatatg 
geetecttett 
aaggttagtg 
caaacctctg 
ecagttgtta 
ggtaagttac 


actatgtag 


egtgtggtta 
acattcctaa 
agtatacaca 
gagtaatgca 
aacaatggct 
atgcacatgt 
ttgtgatate 
ccaataatgg 
etetaggtgg 
acattgcaca 
cagaagcatt 
gaaaaacget 
ettatagtat 
atttgaaaac 


tggtaagaga 


<210> SEQ ID NO 6 
<211> LENGTH: 145 


<212> TYPE 


<213> ORGANISM: 


PRT 


<400> SEQUENCE: 6 


79 


ttattttgta 
gettgatgat 
taaagttgta 


tggcattatt 


taatatgect 
ttatggtgtt 
actctgtcaa 
ttttggaget 
cecagaaggg 
ttetgtgett 
tgatatgtat 
caatgatgac 
tagttttget 
ggattgtgca 
ettgattggt 
gcacgcaaat 
atttgacgtt 
tgaacaaaag 


tgttggtaac 


Ser Lys Gly His Glu Thr Glu Glu 


Leu Cys Ser 


Phe Ala Val Asp Pro 


20 


Ala Ala Gly Asn Gln Pro Leu Gly 


35 


40 


His Asn Gly Ser Gly Phe Ala Ile 


50 


Asp Gln Asp 


65 


His Ile Ala 


55 


Ser Tyr Gly Gly Ala 


70 


His Pro Gly Gly Ala 


85 


Phe Lys Gly Ser Phe Val Gln Ile 


Gly Phe Cys 


100 


Leu Arg Asn Lys Val 
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ttggcagaca 


ttcttagaac 


acagtgtcaa 


aaaacatgtt 


Infectious bronchitis virus 


gaactttata 
ggaatagegt 
tacctttcga 
ggaagtgaca 
acactccttg 
tcagattgca 
acagacaatg 
gttttcatat 
gtaaaagtga 
tggtggacaa 
gttaattatt 
tatatatttt 
gctaagtttg 
acagacttag 


acctctttta 


Infectious bronchitis virus 


Val Asp Ala 
10 


Ala Asp Thr 
25 


Asn Cys Val 


Thr Ser Lys 


Ser Val Cys 
75 


Gly Asn Leu 
90 


Pro Thr Thr 
105 


Cys Thr Val 


-continued 


atggttccta 
ttcttaggaa 
ttgattacca 


atccacagcet 


aagttcagaa 


tgccaagtgg 
aaacaacaat 
aaggagtggt 
tegataatga 
ataaatataa 
attcaaaaag 
atctctcaag 
cagagacaag 
tgttttgtac 
tgggtgcaag 
ggaggaattg 
atttgagatt 
tctttaattt 


ctagtgactec 


caagcaagtg 
catactgaaa 
tagcataaat 


tcaa 


ttgtgttatg 
tattatgatg 
gtgtgtacceg 
gccaggtagt 
tattgtagac 
gacagagcac 
aaagcatgaa 
ttttcttcgt 
ttggcacgaa 
agcagtgaat 
tgaaaaggtt 
taattattta 
gaaagcaacg 
aattaagtgt 


ttttgtgtgt 


Val Gly Ile Leu Ser 


Tyr Cys Lys 


30 


15 


Tyr Val 


Lys Met Leu Thr Val 


45 


Pro Ser Pro Thr Pro 


60 


Leu Tyr Cys 


Arg Ala 
80 


Asp Gly Arg Cys Gln 


95 


Glu Lys Asp Pro Val 
110 


Cys Gln Cys 


Trp Ile 


840 


900 


960 


1014 


60 


120 


180 


240 


300 


360 


420 


480 


540 


600 


660 


720 


780 


840 


900 


909 


80 


115 


81 


120 
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-continued 


125 


Gly Tyr Gly Cys Gln Cys Asp Ser Leu Arg Gln Pro Lys Pro Ser Val 


Gln 
145 


130 


<210> SEQ ID NO 7 


<211> LENGTH: 


<212> TYPE: 
<213> ORGANISM: 


PRT 


<400> SEQUENCE: 


Gly 


AL 


Asn 


Thr 


val 


65 


Asn 


Gly 


Gly 


Gly 


Phe 


145 


val 


ser 


Thr 


Gly 


Trp 


225 


Asp 


Leu 


Ile 


Asn 


ser 


305 


Ala 


Thr 


Ala 


Asp 


Tyr 


50 


Glu 


Val 


Thr 


Ala 


Asn 


130 


Asn 


Arg 


Asp 


Leu 


ser 


210 


Lys 


Ile 


His 


Met 


Trp 
290 


Ser 


Leu 


521 


135 


140 


Infectious bronchitis virus 


7 


Gly Leu Phe 


Tyr 


Glu 


35 


Lys 


Gly 


Arg 


Asn 


Asp 


115 


Asn 


His 


Pro 


Cys 


Arg 


195 


Arg 


His 


Gln 


Cys 


Thr 
275 
Asp 


Cys 


Lys 


Ala 


20 


Leu 


His 


Cys 


Gly 


Ile 


100 


Phe 


Phe 


Leu 


Arg 


Val 


180 


Tyr 


Ala 


Cys 


Gln 


Asn 


260 


Arg 


Leu 


Arg 


val 


5 


Ala 


Leu 


His 


Trp 


85 


Gly 


val 


Glu 


Arg 


Ile 


165 


val 


Phe 


Thr 


Leu 


Trp 


245 


val 


Cys 


Thr 


Tyr 


Asn 
325 


Lys 


Thr 


Ala 


Ile 


Asn 


70 


val 


Thr 


val 


Pro 


Ala 


150 


val 


Phe 


val 


Thr 


Gly 


230 


Gly 


His 


Leu 


Tyr 


Leu 
310 


Val 


Ile 


Thr 


Leu 


ser 


55 


Met 


Gly 


Asn 


Thr 


val 


135 


Leu 


Gln 


val 


Lys 


Phe 


215 


Phe 


Tyr 


Gly 


Ala 


Pro 
295 


Gln 


Val 


Cys 


Lys 


Val 


40 


Leu 


Phe 


Phe 


Leu 


Pro 


120 


Asn 


Phe 


Met 


Thr 


Ile 


200 


Asn 


Asp 


Ser 


His 


Ile 


280 


His 


Arg 


Tyr 


Asn 


Ala 


25 


Asn 


Leu 


Ile 


Asp 


Pro 


105 


Glu 


Ser 


Lys 


Leu 


Trp 


185 


Gly 


Ser 


Phe 


Gly 


Ala 


265 


Asn 


Ile 


Met 


Asp 


Lys Glu Phe 
10 


Leu Ala Ala 


Val Glu Ala 


Gly Phe Lys 


60 


Thr Arg Asp 
75 


Val Glu Ala 
90 


Phe Gln Val 


Gly Leu Val 


Lys Ala Pro 


140 


Ser Ala Lys 
155 


Ala Asp Asn 
170 


Cys His Gly 


Lys Asp Gln 


His Thr Gln 


220 


Val Tyr Asn 
235 


Asn Leu Gln 
250 


His Val Ala 


Asn Ala Phe 


Ala Asn Glu 


300 


Tyr Leu Asn 
315 


Ile Gly Asn 
330 


Ser 


Thr 


Gly 


45 


Met 


Glu 


Thr 


Gly 


Asp 


125 


Pro 


Pro 


Leu 


Leu 


Val 


205 


Ala 


Pro 


Phe 


Ser 


Cys 


285 


Asp 


Ala 


Pro 


Gly 


Tyr 


30 


Ser 


Ser 


Ala 


His 


Phe 


110 


Thr 


Gly 


Trp 


Cys 


Glu 
190 


Cys 


Tyr 


Leu 


Asn 


Ala 


270 


Glin 


Glu 


Cys 


Lys 


Val 


15 


Lys 


Glu 


val 


Ile 


Ala 


95 


Ser 


Ser 


Glu 


His 


Asn 


175 


Leu 


Ser 


Ala 


Leu 


His 


255 


Asp 


Asp 


Val 


Val 


Gly 
335 


His 


Val 


Ile 


Asn 


Arg 


80 


Cys 


Thr 


Ile 


Gln 


val 


160 


val 


Thr 


Cys 


Cys 


Val 


240 


Asp 


Ala 


Val 


Asn 


Asp 


320 


Ile 


82 


Lys 


Pro 


Lys 


Asp 


385 


Leu 


Asn 


Asn 


Thr 


Lys 


465 


His 


Thr 


Leu 


Cys 


Ile 


Asp 


370 


Cys 


Ser 


Lys 


Leu 


Ile 


450 


Asp 


Ala 


Ala 


Trp 


val 


val 


355 


Lys 


Tyr 


val 


His 


Lys 


435 


Gln 


Cys 


Gln 


Gly 


Lys 
515 


Arg 


340 


Pro 


Phe 


Pro 


Phe 


Ala 


420 


Ala 


Leu 


Ile 


Met 


Phe 


500 


Ser 


Arg 


Asn 


Ala 


Asp 


Asn 


405 


Phe 


Met 


Asp 


Thr 


Tyr 


485 


Thr 


Phe 


<210> SEQ ID NO 8 


<211> LENGTH: 


<212> TYPE: PRT 
<213> ORGANISM: 


<400> SEQUENCE: 


Ser Ile Asp Asn 


L 


val 


Leu 


Thr 


65 


Asn 


Asn 


Ile 


Ala 


Tyr 
145 


Lys 


val 


Ile 


Ile 


Pro 


50 


Leu 


Gly 


Thr 


val 


Asp 


130 


val 


Asp 


Thr 


Ala 


Asp 


35 


Thr 


Pro 


Phe 


val 


Leu 


115 


Asn 


Glu 


Gly 


Leu 


Gly 


20 


Gln 


Ser 


Asn 


val 


Lys 


100 


Tyr 


Ala 


Ile 


Ala 


Pro 
180 


338 


Gly 


Val 


Asp 


Asn 


390 


Leu 


His 


Pro 


Gly 


Lys 


470 


Ala 


Phe 


Ser 


83 


Asp 
Lys 
Gly 
375 
Ser 
Pro 
Thr 
Phe 
Val 
455 
Cys 
Asp 


Trp 


Ala 


Leu 


Gln 


360 


Leu 


Leu 


Gly 


Pro 


Phe 


440 


Ala 


Asn 


Phe 


val 


Leu 
520 


Asn 


345 


Phe 


Cys 


Val 


Cys 


Lys 


425 


Phe 


Gln 


Ile 


Val 


Thr 


505 


Gln 


Phe 


Glu 


Met 


Cys 


Asn 


410 


Phe 


Tyr 


Asp 


Gly 


Thr 


490 


Asn 
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Arg 


Tyr 


Phe 


Arg 


395 


Gly 


Asp 


Asp 


Leu 


Gly 


475 


Ser 


Asn 


Infectious bronchitis virus 


8 


Gly 


val 


Asn 


Tle 


85 


val 


Asp 


Val 


Pro 


Asn 
165 


Asn 


Ala 


Met 


Val 


Ala 


Arg 


70 


Trp 


Cys 


Asp 


Leu 


Ser 
150 


Leu 


Thr 


Tyr 


Pro 


Glu 


Phe 


55 


Ile 


Asp 


Ala 


Arg 


Val 
135 
Asn 


Tyr 


Leu 


Asn 


Thr 


Lys 


40 


Glu 


Leu 


Tyr 


Tyr 


Tyr 


120 


Ser 


Leu 


val 


Asn 


Met 


Ile 


25 


Ala 


Leu 


Lys 


Thr 


Thr 


105 


Gly 


Thr 


Leu 


Tyr 


Thr 
185 


Tyr 


10 


Val 


val 


Tyr 


Gly 


Asn 


90 


Asp 


Asp 


Gln 


val 


Lys 
170 


Gln 


Lys 


Thr 


Phe 


Ala 


Leu 


75 


Gln 


Ile 


Tyr 


Cys 
Glin 
155 


Arg 


Gly 


-continued 


Phe 


Asp 


Trp 


380 


Tyr 


Gly 


Arg 


Ser 


Val 


460 


Ala 


Tyr 


Phe 


Gly 


Gly 


Phe 


Lys 


60 


Gly 


Thr 


Glu 


Gln 


Tyr 


140 


Asn 


Val 


Arg 


Tyr 


Tyr 


365 


Asn 


Asp 


Ser 


Thr 


Ser 


445 


Ser 


Val 


Asn 


Asn 


Gly 


Asp 


Asn 


45 


Arg 


Val 


Pro 


Pro 


Ser 


125 


Lys 


Gly 


Asn 


Ser 


Asp 
350 


Asn 


Cys 


Thr 


Leu 


Ser 


430 


Pro 


Leu 


Cys 


Ala 


Pro 
510 


His 


Lys 


30 


Glin 


Asn 


Asp 


Leu 


Ile 


Gly 


Tyr 
190 


Lys 


Glin 


Asn 


Cys 


Ala 


Lys 


Ala 
495 


Tyr 


Tyr 


15 


Val 


Thr 


Ile 


Val 


Tyr 


95 


Gly 


Leu 


Tyr 


Pro 


Ala 


aiehS: 


Glu 


Asn 


His 


Val 


Glu 


Thr 


Lys 


480 


Val 


Asn 


Asp 


Phe 


Ile 


Leu 


Ala 


Ser 


Leu 


160 


Phe 


Thr 


84 


Phe 


Glu 


Leu 


225 


Ile 


val 


Asp 


Asp 


Asn 


305 


Phe 


Leu 


<210> 
<211> 
<212> 
<213> 


<400> 


Glu 


Ser 


210 


Tyr 


Gly 


Thr 


Asn 


Asp 


290 


Lys 


Met 


Gln 


Pro 


195. 


Phe 


Gly 


Met 


Asn 


Gly 


275 


Phe 


Ser 


Thr 


Arg 


val 


Glu 


Cys 


Ser 


260 


Ser 


Leu 


Lys 


Trp 


ORGANISM: 


SEQUENCE : 


Ser Ala Trp Thr 


1 


Ala 


Cys 


val 


65 


Thr 


Asp 


Cys 


Met 


Asn 


145 


Asn 


ser 


Thr 


val 


Cys 


Leu 


Gln 


50 


Met 


val 


Ile 


Asn 


Tyr 


130 


Asn 


Asn 


Trp 


Met 


Gly 
210 


Val 


Pro 


35 


Tyr 


His 


Leu 


val 


Lys 


115 


Thr 


Gly 


Leu 


His 


Phe 
195 


val 


Met 


20 


Ser 


Leu 


Phe 


Lys 


Asp 


100 


Tyr 


Asp 


Asn 


Ala 


Glu 
180 


Cys 


Asn 


Ser Asp 


Glu Lys 


Val Asp 
230 


Arg Leu 
245 


Asp Ser 


Tyr Lys 


Glu Leu 


Val Val 


310 


Phe Glu 
325 


SEQ ID NO 9 
LENGTH: 
TYPE: PRT 


302 


85 


Val 
Tyr 
215 
Lys 
Leu 
Asp 
Gln 
Leu 
295 


Thr 


Asp 


Glu 


200 


Gly 


Pro 


Arg 


val 


Val 


280 


Arg 


val 


Gly 


Arg 


Lys 


Gln 


Ala 


Met 


265 


Cys 


Asn 


Ser 


Ile 


Asp 


Glu 


Leu 


Asn 


250 


Gln 


Thr 


Ile 


Ile 


Ile 
330 
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Phe 


Leu 


Gly 


235 


Lys 


Asn 


Val 


Leu 


Asp 


315 


Lys 


Infectious bronchitis virus 


9 


Cys Gly 
5 


Glu Pro 


Gly Ile 


Ser Lys 


Gly Ala 


70 


Gln Trp 
85 


Tyr Val 


Lys Thr 


Asn Asp 


Asp Asp 


150 
Leu Gly 
165 
Val Leu 


Thr Ala 


Tyr Leu 


Tyr 


Cys 


Met 


Thr 


55 


Gly 


Leu 


ser 


Glu 


ser 


135 


Val 


Gly 


Tyr 


Val 


Gly 
21:5 


Asn 


Asn 


Met 


40 


Thr 


Ser 


Pro 


Asp 


His 


120 


Lys 


Phe 


Ser 


Asp 


Asn 
200 


Ala 


Met 


Ile 


25 


Asn 


Met 


Asp 


Glu 


Ala 


105 


Lys 


Arg 


Ile 


Phe 


Ile 
185 


Ala 


Ser 


Pro 


10 


Pro 


val 


Cys 


Lys 


Gly 


90 


His 


Phe 


Lys 


Tyr 


Ala 
170 
Ala 


Ser 


Glu 


Glu 


Asn 


Ala 


Val 


Gly 


75 


Thr 


Val 


Asp 


His 


Leu 


155 


Val 


Glin 


Ser 


Lys 


-continued 


Leu 


Gly 


220 


Gly 


Leu 


Tyr 


Val 


Lys 


300 


Tyr 


Thr 


Leu 


Tyr 


Lys 


Pro 


60 


Val 


Leu 


ser 


Leu 


Glu 


140 


Ser 


Lys 


Asp 


Ser 


Val 
220 


Asp 


205 


Leu 


Leu 


Asn 


Phe 


Asp 


285 


Glu 


His 


Cys 


Tyr 


Gly 


Tyr 


45 


His 


Ala 


Leu 


Val 


Val 


125 


Gly 


Ser 


Val 


Cys 


Glu 
205 


Lys 


Met 


Glin 


His 


Ala 


Val 


270 


Leu 


Tyr 


Ser 


Tyr 


Lys 


val 


30 


Thr 


Asn 


Pro 


Val 


Leu 


110 


Ile 


Val 


Phe 


Thr 


Ala 
190 


Ala 


Val 


Ser 


His 


Thr 


Lys 


255 


Leu 


Leu 


Gly 


Ile 


Pro 
335 


val 


15 


Gly 


Gln 


Met 


Gly 


Asp 


95 


Ser 


Ser 


Ile 


Leu 


Glu 
175 
Trp 


Phe 


Ser 


Glu 


Ile 


Val 


240 


Ser 


Ala 


Leu 


Thr 


Asn 


320 


Gln 


Gln 


Ile 


Leu 


Arg 


Ser 


80 


Asn 


Asp 


Asp 


Ala 


Arg 


160 


Thr 


Trp 


Leu 


Gly 


86 


Lys 


225 


Glin 


Leu 


Leu 


Gly 


<210> 
<211> 
<212> 
<213> 
<220> 
<223> 


<400> 


Thr 


Thr 


Lys 


val 


Asn 
290 


Leu 


Ser 


Ala 


Phe 


275 


Thr 


His 


Ala 


Thr 


260 


Asn 


Ser 


PRT 


SEQUENCE : 


Ser Lys Gly His 


1 


Ala 


His 


Asp 


65 


His 


Phe 


Gly 


Gly 


Gln 
145 


<210> 
<211> 
<212> 
<213> 
<220> 
<223> 


Cys 


Ala 


Asn 


50 


Gln 


Ile 


Lys 


Phe 


Tyr 
130 


Ser 


Gly 


35 


Gly 


Asp 


Ala 


Gly 


Cys 


115 


Gly 


Phe 


20 


Asn 


Ser 


Ser 


His 


Ser 


100 


Leu 


Cys 


PRT 


<400> SEQUENCE: 


Gly 
1 


Thr 


Val 


Thr 


Ala 


Glu 


Ala 


Tyr 


245 


Pro 


Leu 


Phe 


SEQ ID NO 10 
LENGTH: 
TYPE: 
ORGANISM: Artificial Sequence 
FEATURE: 
OTHER INFORMATION: Mutated Nsp10 sequence 


145 


10 


Gln 


Gly 


Tyr 


Leu 


85 


Phe 


Arg 


Gln 


SEQ ID NO 11 
LENGTH : 
TYPE: 
ORGANISM: Artificial Sequence 
FEATURE: 
OTHER INFORMATION: Mutated Nsp14 sequence 


521 


11 


Gly Leu Phe 


Tyr 
Glu 
35 


Lys 


Gly 


Ala 


His 


Cys 


5 


His 


Asn 
230 
Ser 
Val 


Ile 


Thr 


Thr 


val 


Pro 


Phe 


Gly 


70 


Gly 


val 


Asn 


Cys 


Lys 


Thr 


Ala 


Tle 


Asn 


87 


Tyr 
Ile 
val 
Lys 


Ser 
295 


Glu 
Asp 
Leu 
Ala 
55 

Gly 
Gly 
Gln 


Lys 


Asp 
135 


Tle 
Thr 
Leu 
Ser 


55 


Met 


Ile 


Phe 


Asn 


Cys 


280 


Asp 


Glu 


Pro 


Gly 


40 


Ile 


Ala 


Ala 


Ile 


Val 


120 


Ser 


Cys 


Lys 


val 


40 


Leu 


Phe 


Phe 
Asp 
Leu 
265 


Gly 


Ser 


Val 


Ala 


25 


Asn 


Thr 


Ser 


Gly 


Pro 


105 


Cys 


Leu 


Asn 


Ala 


25 


Asn 


Leu 


Ile 


US 10,130,701 B2 


Trp Arg 
235 


Val Ala 
250 
Lys Thr 


Lys Leu 


Phe Val 


Asp Ala 
10 

Asp Thr 
Cys Val 
Ser Lys 
Val Cys 

75 

Asn Leu 
90 

Thr Thr 


Thr Val 


Arg Gln 


Lys Glu 


10 


Leu Ala 


Val Glu 


Gly Phe 


Thr Arg 


-continued 


Asn Cys 


Lys Phe 


Glu Gln 


Leu Val 
285 


Cys Thr 
300 


Val Gly 


Tyr Cys 


Lys Met 
45 


Pro Ser 
60 


Leu Tyr 


Asp Gly 


Glu Lys 


Cys Gln 
125 


Pro Lys 
140 


Phe Ser 


Ala Thr 


Ala Gly 
45 


Lys Met 
60 


Asp Glu 


Asn 


Asp 


Lys 


270 


Arg 


Met 


Ile 


Lys 


30 


Leu 


Pro 


Cys 


Arg 


Asp 


110 


Cys 


Pro 


Gly 


Tyr 


30 


Ser 


Ser 


Ala 


Tyr 
Leu 
255 


Thr 


Asp 


Leu 
15 


Tyr 


Thr 


Thr 


Arg 


Cys 


95 


Pro 


Trp 


Ser 


Val 


15 


Lys 


Glu 


Val 


Ile 


Leu 
240 
Arg 


Asp 


Val 


Ser 


val 


Val 


Pro 


Ala 


80 


Gln 


val 


Ile 


Val 


His 


Val 


Ile 


Asn 


Arg 


88 


65 


Asn 


Gly 


Gly 


Gly 


Phe 


145 


val 


Ser 


Thr 


Gly 


Trp 


225 


Asp 


Leu 


Ile 


Asn 


Ser 


305 


Ala 


Lys 


Pro 


Lys 


Asp 


385 


Leu 


Asn 


Asn 


Thr 


Lys 
465 


His 


val 


Thr 


Ala 


Asn 


130 


Asn 


Arg 


Asp 


Leu 


Ser 


210 


Lys 


Ile 


His 


Met 


Trp 


290 


Ser 


Leu 


Cys 


Ile 


Asp 


370 


Cys 


Ser 


Lys 


Leu 


Ile 
450 


Asp 


Ala 


Arg 


Asn 


Asp 


glia Eis 


Asn 


His 


Pro 


Cys 


Arg 


195 


Arg 


His 


Gln 


Cys 


Thr 


275 


Asp 


Cys 


Lys 


val 


val 


355 


Lys 


Tyr 


val 


His 


Lys 


435 


Gln 


Cys 


Gln 


Gly 


Ile 


100 


Phe 


Phe 


Leu 


Arg 


val 


180 


Tyr 


Ala 


Cys 


Gln 


Asn 


260 


Arg 


Leu 


Arg 


Val 


Arg 


340 


Pro 


Phe 


Pro 


Phe 


Ala 


420 


Ala 


Leu 


Ile 


Met 


Tp: 


85 


Gly 


Val 


Glu 


Arg 


Ile 


165 


Val 


Phe 


Thr 


Leu 


Trp 


245 


val 


Cys 


Thr 


Tyr 


Asn 


325 


Arg 


Asn 


Ala 


Asp 


Asn 


405 


Phe 


Met 


Asp 


Thr 


Tyr 
485 


70 


Val 


Thr 


Val 


Pro 


Ala 


150 


Val 


Phe 


val 


Thr 


Gly 


230 


Gly 


His 


Leu 


Tyr 


Leu 


310 


val 


Gly 


val 


Asp 


Asn 


390 


Leu 


His 


Pro 


Gly 


Lys 


470 


Ala 


89 


Gly 
Asn 
Thr 
Val 
135 
Leu 
Gln 
Val 
Lys 
Phe 
215 
Phe 
Tyr 
Gly 
Ala 
Pro 
295 
Gln 
val 
Asp 
Lys 
Gly 
375 
ser 
Pro 
Thr 


Phe 


Val 
455 


Cys 


Asp 


Phe 


Leu 


Pro 


120 


Asn 


Phe 


Met 


Thr 


Ile 


200 


Asn 


Asp 


Ser 


His 


Ile 


280 


His 


Arg 


Tyr 


Leu 


Gln 


360 


Leu 


Leu 


Gly 


Pro 


Phe 
440 
Ala 


Asn 


Phe 


Asp 


Pro 


105 


Glu 


Ser 


Lys 


Leu 


Trp 


185 


Gly 


Ser 


Phe 


Gly 


Ala 


265 


Asn 


Ile 


Met 


Asp 


Asn 


345 


Phe 


Cys 


Leu 


Cys 


Lys 


425 


Phe 


Glin 


Ile 


val 


val 


90 


Phe 


Gly 


Lys 


Ser 


Ala 
170 


Cys 


Lys 


His 


val 


Asn 


250 


His 


Asn 


Ala 


Tyr 


Ile 


330 


Phe 


Glu 


Met 


Cys 


Asn 


410 


Phe 


Tyr 


Asp 


Gly 


Thr 
490 
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75 


Glu 


Glin 


Leu 


Ala 


Ala 


155 


Asp 


His 


Asp 


Thr 


Tyr 


235 


Leu 


Val 


Ala 


Asn 


Leu 


315 


Gly 


Arg 


Tyr 


Phe 


Arg 


395 


Gly 


Asp 


Asp 


Leu 


Gly 
475 


Ser 


-continued 


Ala 


Val 


Val 


Pro 


140 


Lys 


Asn 


Gly 


Gln 


Gln 


220 


Asn 


Glin 


Ala 


Phe 


Glu 


300 


Asn 


Asn 


Phe 


Asp 


Trp 


380 


Tyr 


Gly 


Arg 


Ser 


Val 
460 


Ala 


Tyr 


Thr 


Gly 


Asp 


125 


Pro 


Pro 


Leu 


Leu 


val 


205 


Ala 


Pro 


Phe 


Ser 


Cys 


285 


Asp 


Ala 


Pro 


Tyr 


Tyr 


365 


Asn 


Asp 


ser 


Thr 


Ser 
445 
Ser 


Val 


Asn 


His 


Phe 


110 


Thr 


Gly 


Trp 


Cys 


Glu 
190 


Cys 


Tyr 


Leu 


Asn 


Ala 


270 


Gln 


Glu 


Cys 


Lys 


Asp 


350 


Asn 


Cys 


Thr 


Leu 


Ser 


430 


Pro 


Leu 


Cys 


Ala 


Ala 


95 


Ser 


Ser 


Glu 


His 


Asn 


175 


Leu 


Ser 


Ala 


Leu 


His 


255 


Asp 


Asp 


Val 


val 


Gly 


335 


Lys 


Gln 


Asn 


Cys 


Ala 


Lys 


Ala 
495 


80 


Cys 


Thr 


Ile 


Gln 


Val 


160 


Val 


Thr 


Cys 


Cys 


val 


240 


Asp 


Ala 


Val 


Asn 


Asp 


320 


Ile 


Asn 


His 


Val 


Glu 


Thr 


Lys 


480 


Val 


90 


91 


US 10,130,701 B2 


-continued 


Thr Ala Gly Phe Thr Phe Trp Val Thr Asn Asn Phe Asn Pro Tyr Asn 


500 


505 


Leu Trp Lys Ser Phe Ser Ala Leu Gln 


<210> 
<211> 
<212> 
<213> 
<220> 
<223> 


515 


PRT 


<400> SEQUENCE: 


Ser 


1 


val 


Leu 


Thr 


65 


Asn 


Asn 


Ile 


Ala 


Tyr 


145 


Lys 


val 


Phe 


Glu 


Leu 


225 


Tle 


val 


Asp 


Asp 


Asn 


305 


Phe 


Ile 


Ile 


Tle 


Pro 


50 


Leu 


Gly 


Thr 


Val 


Asp 


130 


Val 


Asp 


Thr 


Glu 


Ser 


210 


Tyr 


Gly 


Thr 


Asn 


Asp 
290 


Lys 


Met 


SEQ ID NO 12 
LENGTH: 
TYPE: 
ORGANISM: Artificial Sequence 
FEATURE: 
OTHER INFORMATION: Mutated Nsp15 sequence 


338 


12 


Asp Asn Ile 


Ala 


Asp 


35 


Thr 


Pro 


Phe 


Val 


Leu 


115 


Asn 


Glu 


Gly 


Leu 


Pro 


195 


Phe 


Gly 


Met 


Asn 


Gly 


275 


Phe 


Ser 


Thr 


Gly 


20 


Gln 


Ser 


Asn 


Val 


Lys 


100 


Tyr 


Ala 


Ile 


Ala 


Pro 


180 


Arg 


val 


Glu 


Cys 


Ser 


260 


Ser 


Leu 


Lys 


Trp 


5 


Gly 


val 


Asn 


Ile 


85 


val 


Asp 


val 


Pro 


Asn 


165 


Asn 


ser 


Glu 


Val 


Arg 


245 


Asp 


Tyr 


Glu 


Val 


Phe 
325 


Ala 


Met 


Val 


Ala 


Arg 


70 


Trp 


Cys 


Asp 


Leu 


Ser 


150 


Leu 


Thr 


Asp 


Lys 


Asp 


230 


Leu 


Ser 


Lys 


Leu 


Val 


310 


Glu 


Tyr 


Pro 


Glu 


Phe 


55 


Tle 


Asp 


Ala 


Arg 


val 


135 


Asn 


Tyr 


Tle 


Val 


Tyr 


215 


Lys 


Leu 


Asp 


Gln 


Leu 
295 


Thr 


Asp 


520 


Asn 


Thr 


Leu 


Tyr 


Tyr 


Tyr 


120 


Ser 


Leu 


Val 


Asn 


Glu 


200 


Gly 


Pro 


Arg 


val 


Val 
280 
Arg 


Val 


Gly 


Met 


Ile 


25 


Ala 


Leu 


Lys 


Thr 


Thr 


105 


Gly 


Thr 


Leu 


Tyr 


Thr 


185 


Arg 


Lys 


Gln 


Ala 


Met 


265 


Cys 


Asn 


Ser 


Ile 


Tyr 


10 


Val 


Val 


Tyr 


Gly 


Asn 


90 


Asp 


Asp 


Gln 


val 


Lys 


170 


Gln 


Asp 


Glu 


Leu 


Asn 


250 


Gln 


Thr 


Ile 


Ile 


Ile 
330 


Lys 


Thr 


Phe 


Ala 


Leu 


75 


Gln 


Ile 


Tyr 


Cys 


Gln 


155 


Arg 


Gly 


Phe 


Leu 


Gly 


235 


Lys 


Asn 


Val 


Leu 


Asp 
315 


Lys 


Gly 


Gly 


Phe 


Lys 


60 


Gly 


Thr 


Glu 


Gln 


Tyr 


140 


Asn 


val 


Arg 


Leu 


Gly 


220 


Gly 


Leu 


Tyr 


Val 


Lys 
300 


Tyr 


Thr 


Gly 


Asp 


Asn 


45 


Arg 


val 


Pro 


Pro 


Ser 


125 


Lys 


Gly 


Asn 


Ser 


Asp 


205 


Leu 


Leu 


Asn 


Phe 


Asp 


285 


Glu 


His 


Cys 


510 


His 


Lys 


30 


Gln 


Asn 


Asp 


Leu 


Asn 


110 


Phe 


Arg 


Ile 


Gly 


Tyr 


190 


Met 


Glin 


His 


Ala 


Val 
270 


Leu 


Tyr 


Ser 


Tyr 


Tyr 
15 

Val 
Thr 


Ile 


val 


Tyr 


95 


Gly 


Leu 


Tyr 


Pro 


Ala 


175 


Glu 


Ser 


His 


Thr 


Lys 


255 


Leu 


Leu 


Gly 


Ile 


Pro 
335 


Asp 


Phe 


Ile 


Arg 


Thr 


80 


Arg 


Leu 


Ala 


Ser 


Leu 


160 


Phe 


Thr 


Glu 


Ile 


Val 


240 


Ser 


Ala 


Leu 


Thr 


Asn 
320 


Glin 


92 


Leu 


<210> 
<211> 
<212> 
<213> 
<220> 
<223> 


Gln 


PRT 


<400> SEQUENCE: 


Ser 


1 


Ala 


Cys 


val 


65 


Thr 


Asp 


Cys 


Met 


Asn 


145 


Asn 


Ser 


Thr 


Tle 


Lys 


225 


Glin 


Leu 


Leu 


Gly 


Ala 


Cys 


Leu 


Gln 


50 


Met 


Val 


Ile 


Asn 


Tyr 


130 


Asn 


Asn 


Trp 


Met 


Gly 


210 


Thr 


Thr 


Lys 


val 


Asn 
290 


Trp 


val 


Pro 


35 


Tyr 


His 


Leu 


Val 


Lys 


115 


Thr 


Gly 


Leu 


His 


Phe 


195 


val 


Leu 


Ser 


Ala 


Phe 


275 


Thr 


Thr 


Met 


20 


Ser 


Leu 


Phe 


Lys 


Asp 


100 


Tyr 


Asp 


Asn 


Ala 


Glu 


180 


Cys 


Asn 


His 


Ala 


Thr 


260 


Asn 


Ser 


SEQ ID NO 13 
LENGTH: 
TYPE: 
ORGANISM: Artificial Sequence 
FEATURE: 
OTHER INFORMATION: Mutated Nsp16é sequence 


302 


13 


Gly 


Ser 


Gly 


Gln 


85 


Tyr 


Lys 


Asn 


Asp 


Leu 


165 


val 


Thr 


Tyr 


Ala 


Tyr 


245 


Pro 


Leu 


Phe 


Gly 


Pro 


Ile 


Lys 


Ala 


70 


Trp 


val 


Thr 


Asp 


Asp 


150 


Gly 


Leu 


Ala 


Leu 


Asn 


230 


ser 


val 


Tle 


Thr 


93 


Tyr 
Cys 
Met 
Thr 
55 

Gly 
Leu 
Ser 
Glu 
Ser 
135 
val 
Gly 
Tyr 
Val 
Gly 
215 
Tyr 
Ile 
Val 


Lys 


ser 
295 


The invention claimed is: 


1. A live, attenuated coronavirus comprising a variant 
replicase gene encoding polyproteins comprising a mutation 
in one or both of non-structural protein(s) nsp-10 and 
nsp-14, wherein the variant replicase gene encodes a protein 


Asn 


Asn 


Met 


40 


Thr 


Ser 


Pro 


Asp 


His 


120 


Lys 


Phe 


Ser 


Asp 


Asn 


200 


Ala 


Ile 


Phe 


Asn 


Cys 


280 


Asp 


Met 


Ile 


25 


Asn 


Met 


Asp 


Glu 


Ala 


105 


Lys 


Arg 


Ile 


Phe 


Ile 


185 


Ala 


Ser 


Phe 


Asp 


Leu 


265 


Gly 


Ser 


Pro 


10 


Pro 


Val 


Cys 


Lys 


Gly 


90 


His 


Phe 


Lys 


Tyr 


Ala 


170 


Ala 


Ser 


Glu 


Trp 


val 


250 


Lys 


Lys 


Phe 
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Glu 


Asn 


Ala 


Val 


Gly 


75 


Thr 


Val 


Asp 


His 


Leu 


155 


Val 


Gln 


Ser 


Lys 


Arg 


235 


Ala 


Thr 


Leu 


Val 


94 


-continued 


Leu 


Tyr 


Lys 


Pro 


60 


Val 


Leu 


Ser 


Leu 


Glu 


140 


Ser 


Lys 


Asp 


Ser 


Val 


220 


Asn 


Lys 


Glu 


Leu 


Cys 
300 


Tyr 


Gly 


Tyr 


45 


His 


Ala 


Leu 


val 


val 


125 


Gly 


Ser 


val 


Cys 


Glu 


205 


Lys 


Cys 


Phe 


Gln 


Val 


285 


Thr 


Lys Val Gln 
15 


Val Gly Ile 
30 


Thr Gln Leu 
Asn Met Arg 
Pro Gly Ser 


80 


Val Asp Asn 
95 


Leu Ser Asp 
110 


Ile Ser Asp 
Val Ile Ala 
Phe Leu Arg 


160 


Thr Glu Thr 
175 


Ala Trp Trp 
190 


Ala Phe Leu 
Val Ser Gly 
Asn Tyr Leu 


240 


Asp Leu Arg 
255 


Lys Thr Asp 
270 


Arg Asp Val 


Met 


60 Comprising an amino acid mutation of Val to Leu at the 
position corresponding to position 393 of SEQ ID NO: 7. 
2. The coronavirus according to claim 1 wherein the 
variant replicase gene encodes a protein comprising one or 
more amino acid mutations selected from: 


comprising an amino acid mutation of Pro to Leu at the 65 


position corresponding to position 85 of SEQ ID NO: 6, 
and/or wherein the variant replicase gene encodes a protein 


an amino acid mutation of Leu to Ile at the position 
corresponding to position 183 of SEQ ID NO: 8; and 


US 10,130,701 B2 


95 
an amino acid mutation of Val to Ile at the position 
corresponding to position 209 of SEQ ID NO: 9. 

3. The coronavirus according to claim 1 wherein the 
replicase gene encodes a protein comprising the amino acid 
mutations Val to Leu at the position corresponding to 
position 393 of SEQ ID NO: 7; Leu to Ile at the position 
corresponding to position 183 of SEQ ID NO: 8; and Val to 
Ile at the position corresponding to position 209 of SEQ ID 
NO: 9. 

4. The coronavirus according to claim 1 wherein the 
replicase gene encodes a protein comprising the amino acid 
mutations Pro to Leu at the position corresponding to 
position 85 of SEQ ID NO: 6; Val to Leu at the position 
corresponding to position 393 of SEQ ID NO: 7; Leu to Ile 
at the position corresponding to position 183 of SEQ ID NO: 
8; and Val to Ie at the position corresponding to position 209 
of SEQ ID NO: 9. 

5. The coronavirus according to claim 1 wherein the 
replicase gene comprises at least one nucleotide substitu- 
tions selected from: 

C to Tat nucleotide position 12137; and 

G to C at nucleotide position 18114; 

compared to the sequence shown as SEQ ID NO: 1; 

and optionally, comprises one or more nucleotide substi- 

tutions selected from T to A at nucleotide position 
19047; and 

G to A at nucleotide position 20139; 
compared to the sequence shown as SEQ ID NO: 1. 

6. The coronavirus according to claim 1 which is an 
infectious bronchitis virus (IBV). 

7. The coronavirus according to claim 1 which is IBV 
M41. 

8. The coronavirus according to claim 7, which comprises 
an S protein at least, part of which is from an IBV serotype 
other than M41. 

9. The coronavirus according to claim 8, wherein the S1 
subunit is from an IBV serotype other than M41. 

10. The coronavirus according to claim 8, wherein the S 
protein is from an IBV serotype other than M41. 

11. The coronavirus according to claim 1 which has 
reduced pathogenicity compared to a coronavirus expressing 
a corresponding wild-type replicase, wherein the virus is 
capable of replicating without being pathogenic to the 
embryo when administered to an embryonated egg. 


S} 


10 


15 


20 


25 


30 


35 


40 


96 


12. A variant replicase gene as defined in claim 1. 

13. A protein encoded by a variant coronavirus replicase 
gene according to claim 12. 

14. A plasmid comprising a replicase gene according to 
claim 12. 

15. A method for making the coronavirus according to 
claim 1 which comprises the following steps: 

(i) transfecting a plasmid according to claim 14 into a host 

cell; 

(ii) infecting the host cell with a recombining virus 
comprising the genome of a coronavirus strain with a 
replicase gene; 

(iii) allowing homologous recombination to occur 
between the replicase gene sequences in the plasmid 
and the corresponding sequences in the recombining 
virus genome to produce a modified replicase gene; and 

(iv) selecting for recombining virus comprising the modi- 
fied replicase gene. 

16. The method according to claim 15, wherein the 

recombining virus is a vaccinia virus. 

17. The method according to claim 15 which also includes 
the step: 

(v) recovering recombinant coronavirus comprising the 
modified replicase gene from the DNA from the recom- 
bining virus from step (iv). 

18. A cell capable of producing a coronavirus according 

to claim 1. 

19. A vaccine comprising a coronavirus according to 
claim 1 and a pharmaceutically acceptable carrier. 

20. A method for treating and/or preventing a disease in 
a subject which comprises the step of administering a 
vaccine according to claim 19 to the subject. 

21. The method of claim 20, wherein the disease is 
infectious bronchitis (IB). 

22. The method according to claim 20 wherein the method 
of administration is selected from the group consisting of; 
eye drop administration, intranasal administration, drinking 
water administration, post-hatch injection and in ovo injec- 
tion. 

23. The method according to claim 21 wherein the admin- 
istration is in Ovo vaccination. 

24. A method for producing a vaccine according to claim 
19, which comprises the step of infecting a cell according to 
claim 18 with a coronavirus according to claim 1. 

25. The coronavirus according to claim 1, further com- 
prising a mutation in one or both of nsp-15 and nsp-16. 


* * * * * 


